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We consider a recurrent Markov process which is an Ito semi-martingale. 
The Levy kernel describes the law of its jumps. Based on observations 
-^0; ^A, • • • , -^nA, we construct an estimator for the Levy kernel's density. 
^ ' We prove its consistency (as nA — ^ oo and A — > 0) and a central limit theo- 

rem. In the positive recurrent case, our estimator is asymptotically normal; 
in the null recurrent case, it is asymptotically mixed normal. Our estimator's 
rate of convergence equals the non-parametric minimax rate of smooth den- 
sity estimation. The asymptotic bias and variance are analogous to those of 
the classical Nadaraya- Watson estimator for conditional densities. Asymp- 
totic confidence intervals are provided. 
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O' 1. Introduction 

(N , 

Statistical inference for jumps in continuous-time models has received significant atten- 
tion in recent years. Due to their well-known tractability properties, a vast amount of 
^ . literature has been devoted to the class of processes with stationary and independent 

\ increments, called Levy processes. The law of their jumps is characterised by their Levy 

measure. Parametric inference for Levy measures has a long history. For recent devel- 
opments in non-parametric settings, we refer, for instance, to Comte and Genon-Catalot 
(2011); to Figueroa-Lopez (2011); to the special issue Gugushvili, Klaassen, and Spreij 
(2010), which contains a collection of interesting papers; to Neumann and ReiB (2009); 
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and to Ueltzhofer and Kliippelberg (2011). Ample references to previous literature can 
be found within the aforementioned. 

In this paper, we consider a Harris recurrent Markov process X which is an Ito semi- 
martingale. Such a process is a solution of some stochastic differential equation 

dXt = b{Xt)dt + a{Xt)dWt + f 6{Xt.,y)li\\s{x,.,y)\\>i}p{dt,dy) 

\ (1-1) 

+ j 5(Xt_,y)l{||5(x,_,j;)||<i}(p - q)(dt,dy), 

with coefficients 6, a and 5; the SDE is driven by some Wiener process W and some 
Poisson random measure p (with intensity measure q(dt,d?/) = dt® A(d?/)); Xt_ denotes 
the left-limit. The law of its jumps is more or less described by the kernel F where, for 
each x, the measure -F(x, ■) coincides with the image of the measure A under the map 
y I—)- S{x,y) restricted to the set {y : 6{x,y) ^ 0}. We call F the (canonical) Levy kernel 
of X. We assume that the measures F{x, d?/) admit a density y i-> /(x, y), and we aim 
for non-parametric estimation of the function (x, y) t— ^ /(x, y). 

On an equidistant time grid, we observe a sample Xo(co'), Xa(w), . . . ,X„a(i^) of the 
process; the jumps are latent. We study a kernel density estimator for /(x, y). We show 
its consistency as nA — j- oo and A — )■ under a smoothness hypothesis on the estimated 
density. In the ergodic case, we obtain asymptotic normality. In the null recurrent 
case, we impose a condition on the resolvent of the process which goes back to Darling 
and Kac (1957). Thereunder, we prove asymptotic mixed normality. We also provide 
a standardised version of our central limit theorem for the construction of asymptotic 
confidence intervals. 

Our results are comparable to those in classical non-parametric density estimation. 
In particular: Our estimator's asymptotic bias and variance resemble those of the 
Nadaraya-Watson estimator in classical conditional density estimation. Just as in the 
classical context, moreover, the bandwidth choice is crucial for our estimator's rate of 
convergence. We conjecture that, for instance, a cross-validation method applies here 
analogously; see Fan and Yim (2004) and Hall, Racine, and Li (2004). By an optimal 
choice, if A fast enough, the rate is t;(r2A)"i"2/[rf{«i+«2)+2aia2]^ where en > (resp., 
a2 > 0) stands for the smoothness of / as a function in x (resp., in y), and the function 
V plays the role of an information rate. In the ergodic case, v{t) = t; in the null recur- 
rent case with Darling-Kac's condition imposed, v(t) = t^£(t) for some < 5 < 1 and 
some slowly varying function i. We remark that, in the case ai = 02, our achieved rate 
w(nA)°i/(2ai+2d) equals the non-parametric minimax rate of smooth density estimation, 
related to the smoothness of / as a 2(i-dimensional function and with respect to v{nA). 

At the core of our statistical problem, we essentially have to study the case first, 
where the process is observed continuously in time and, in particular, all jumps are 
discerned. In this case, we can consider a more general class of quasi-left-continuous, 
strong Markov processes with cadlag sample paths than just Ito semi-martingales. For 
these, the law of their jumps is again described by their Levy kernel. We present a version 
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of our estimator which utihses that the sojourn time of certain sets and the jumps are 
observed. Under shghtly weaker assumptions, we prove the estimator's consistency and 
asymptotic (mixed) normahty. As these results are vahd for a quite general class of 
processes, we believe that they are of independent interest, not only as a benchmark for 
all possible estimators which are based on some discrete observation scheme. 

For discrete-time Markov chains, a related result is presented in Karlsen and Tj0stheim 
(2001). We are aware that our final steps of proof appear to be similar. We emphasise 
that the main difficulties in our context, however, come in two respects: on the one hand, 
from establishing an appropriate auxiliary framework where related methods apply; on 
the other hand, from the discrete observation scheme where our primary objects of 
interest - the jumps - are latent. 

For continuous-time Markov processes, apart from the Levy process case and as far 
as known to us, estimation of their Levy kernel has been confined to the special case 
of Markov step processes. For these, there is a one-to-one correspondence between 
the Levy kernel and the infinitesimal generator. Efficient non-parametric estimation of 
Markov step process models has been studied by Greenwood and Wefelmeyer (1994). 
They assume the mean holding times to be bounded, and the transition kernel to be 
uniformly ergodic. This excludes the null recurrent case. The work on parametric 
estimation of Markov step processes is more exhaustive. The null recurrent case has 
been studied, for instance, by Hopfner (1993). There, the process is observed up to a 
random stopping time such that a deterministic amount of information (or more) has 
been discerned. Local asymptotic normality is shown in various situations. With a 
slightly different aim, in contrast, Hopfner, Jacod, and Ladelli (1990) considers Markov 
step processes observed up to a deterministic time. Accordingly, the observed amount of 
information is random. Local asymptotic mixed normality (of statistical experiments) 
is shown under Darling-Kac's condition. Here, we utilise some of their results and 
methods. We improve upon the restrictions within the aforementioned literature: First 
and foremost, we do not restrict ourselves to Markov step processes. Second, we consider 
processes, null recurrent in the sense of Harris, in a non-parametric setting. Third, we 
address the infiuence of observations on a discrete time grid. 

We briefiy outline our paper. In section 2 we study the estimation of the Levy kernel 
based on discrete observations. Split into three subsections, we present the statistical 
problem with our standing assumptions; we give our estimator along with a bias cor- 
rection; and state our main results - the estimator's consistency and the central limit 
theorem. In section 3, we study the case where continuous-time observations are avail- 
able. This section is organised analogously to section 2. The corresponding proofs are in 
section 4. The proofs for our main results of section 2 are in section 5. Each proofs sec- 
tion comes with its own short outline at its beginning. Since we bring together potential 
theoretic aspects of Markov processes with functional and martingale limit theory, we 
put some of our technical considerations off to appendix A. 
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2. Estimation from high-frequency observations 

2. Density estimation of the Levy kernel from 
high-frequency observations 

2.1. Preliminaries and assumptions 

On the filtered probability space(s) (fi, (=^t)f>o, (P^)xg-b), let X = {Xt)t>Q be a 
Markovian Ito semi- martingale with values in Euclidean space E = (R'^,^'^), or a 
subset thereof, such that P'^(Xo = x) = 1 for all x. For n G N and A > 0, we observe 
Xo{oj) and the increments 

A^XH :=XfcAH-X(fc_i)AH fc = l,...,n. (2.1) 

We emphasise that the jumps of the process are latent. 

Throughout this paper, we use the following notation: We abbreviate E* := E\ {0}. 
We denote the Dirac measure at x by e^. For vr an (initial) probability on E, we denote 
the expectation w. r. t. the law P'" := /7r(dx)P'' by E'". For a > and A C E, in 
addition, C^^{A) denotes the class of all continuous functions on A which are [aj -times 
continuously differentiable such that every x E A has a neighbourhood on which the 
function's (partial) [aJ -derivatives are uniformly Holder of order a — \_a\. 

The characteristics (5, C, n) of X are absolutely continuous with respect to Lebesgue 
measure; there are mappings h : E E and c : E E ® E, and a kernel F on E with 
F{x, {0}) = such that 

Bt= f\{Xs)ds, Ct= f\{Xs)ds, and n{dt,dy) = dt ® F(Xt,dy). (2.2) 
Jo Jo 

The integer- valued random measure Z]{s:AXa^o} ^(s,AXs)(dt, dy) on ]R_|_ x ]R!^ is called the 
process's jump measure. The random measure n is its predictable compensator: For 
every Borel function g : E x E ^ R+, (inital) probability vr, and t > 0, we have 

g{X,^,AX,)l{xM} = / F{Xs,dy)g{X„y). (2.3) 

We call F the Levy kernel. It is unique outside a set of potential zero. We assume it 
admits a density (x, y) ^-^ f{x, y) which we want to estimate. 

Throughout, we work under the following technical hypothesis on the characteristics: 

2.1 Assumption. 

(i) The process X satisfies the following (linear) growth condition: There exists a 
constant ( < oo and a Levy measure F on E such that 

\\bix)\\<ai + \\x\\), ||c(x)|| <C(l + ||xf), and F(x, A) < (1 + ||x||)F(A) 

holds for all X E E and every Borel set A C E. We denote by /3 G [0, 2] some 
constant such that / F(dw)(||w||^ A 1)) < oo. 
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(ii) The Levy measure F admits a density / which is continuous on E*. 

(iii) There exists a constant ( < oo such that sup||^||>;^||2;||/(z) < (. ^ 

Remark. Apart from the growth condition, there is no assumption on b and c. Whether 
X is a weak or a strong solution of (1.1) is irrelevant to us. 

We impose assumptions on the recurrence of X and on the smoothness of /. To obtain 
consistency for our estimator below, we impose: 

2.2 Assumption. The process X is Harris recurrent: On E, there exists a cx-finite, 
invariant measure fi for X such that, for every Borel set A (1 E, we have 



2.3 Assumption. For some a > 0, the Levy kernel admits a density / G C^ci-^ ^ 
and the invariant measure from Assumption 2.2 admits a continuous density fi'. ^ 

To obtain a central limit theorem, we also impose: 

2.4 Assumption. The process X satisfies the following Darling-Kac condition: For 
some < 6 < 1, there exists a function v : R+ — > R+ - at infinity, regularly varying of 
index 6 - such that, for every /x-integrable g, 



Remark. In the positive recurrent case (that is, when /x is finite). Assumption 2.4 indeed 
is satisfied for 5 = 1 and with v{t) = t/fi{E). We refer the interested reader to Touati 
(1987) and to Hopfner and Locherbach (2003). 

2.5 Assumption. For some ai,a2 > 2, the Levy kernel admits a density / which is 
twice continuously differentiable on E x E* such that x i— f{x,y) G C^].{E) for all 
y G E*, and y i— )■ f{x,y) G C^l{E*) for all x G -E; and the invariant measure from 
Assumption 2.2 admits a continuous density /x' which is {\ai \ — l)-times continuously 
differentiable. o 

Example. Suppose that / is bounded and vanishes outside {||x|| < l,||y|| < 1}; that 
is, there are neither jumps with left-limit outside the unit ball nor jumps of size bigger 
than one. Then our process's recurrence (or transience) is completely determined by 
drift and volatility. For instance: 




o 




(2.4) 
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(i) If the volatility a vanishes everywhere and the drift satisfies b{x) = —x, then X is 
positive recurrent. 

(ii) If the drift b vanishes everywhere, and the volatility satisfies cr(x) = 1, then X is 
not positive. In fact, X has the recurrence (or transience) of Brownian motion: 
In the univariate case, X is null recurrent and Darling-Kac's condition holds with 
5 = 1/2; in the bivariate case, X is null recurrent and Darling-Kac's condition 
fails; and in all other multivariate cases, X is transient. 

2.2. Kernel density estimator 

In principle, we are free to choose our favourite estimation method, e. g., the method of 
sieves with projection estimators. Here, however, we introduce a kernel density estimator 
as it allows for a more comprehensible presentation of the proofs. Also, the method is 
well- understood in the context of classical (conditional) density estimation. 

An outline: First, we choose smooth kernels gi and g2 with support -Bi(O) (the unit 
ball centred at zero) which are, at least, of order ai and a2, respectively; that is, for 
every multi-index m = (mi, . . . , m^) G N*^ \ {0} and each i G {1,2}, we have 

|m| := mi + ■ ■ ■ + rrid < =^ Umigi) '■= J x^'^gi{x)dx = 0. (2.5) 

Second, we choose a bandwidth vector 77 = (771,772) > 0. Last, we construct an estimator 
for f{x,y) using the kernels g^'^{z) := r]~'^gi{{z — x)/r]i). If the bandwidth is chosen 
appropriately, we achieve a consistent estimator which follows a central limit theorem. 

2.7 Definition. For 77 = (7/1,7/2) > 0, we call /^''' defined by 



the kernel density estimator of / (w. r. t. bandwidth 7/ based on Xq, Xa, . . . , X„a)- o 

In analogy to classical conditional density estimation, we also introduce a bias correc- 
tion for our estimator. 

2.8 Definition. For 7/ = (7/1,7/2) > 0, we call 7^'^ defined by 








(2.6) 




-r»i+m2(fi'l) I^fc=l ^;^5'l''^(^(fc-l)A) d"^'^ -A 



the bias correction for f^'^- (The sums in the previous equation are over all multi-indices 
of appropriate length.) o 
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2.3. Consistency and central limit theorem 

Here, we present our main results. We agree to the following conventions: Under As- 
sumptions 2.2 and 2.4, v denotes the regularly varying function given in (2.4). Under 
Assumption 2.2 only, v denotes an arbitrary deterministic equivalent (see Definition 4.1 
below) of the Markov process X. For typographical reasons, we may write Vt for v{t) 
or X{t) for Xt etc. as convenient. 

We utilise the following conditions as nA — > oo and A — )■ 0, where < Ci, C2 < oo: 

VnAvinVin and ?7i,„ 0, r]2,n -5- 0; (2.7) 

VnAvtn'^'vin^Cl and t^.A^^^^"^ ^ Cai (2.8) 

In addition, we also utilise the following conditions due to discretisation, where ( < 00 
is independent of n: 

^^-2-.[{i-2/(,+.))vo] _^ and Ar/^-f^^+'U 0; (2.9a) 
nAX^vin<C ..aA^<--^[(^-/('^^^)^°1<„^0, (2.9b) 
and v^^AXnVV'''^'^'^ ^ 0. (2.9c) 

Remark. If A — ?■ fast enough, then (2.7) and (2.8) are the crucial conditions. 

2.9 Theorem. Grant Assumptions 2.1 to 2.3. Let rjn = {^i,n-,f]2,n) he such that (2.7) 
and (2.9a) hold. Moreover, let {x,y) & E x E* be such that fi'{x) > and F{x,E) > 0. 

(i) If nA"^ — ^ 0, then, under any law IP^ , we have the following convergence in proba- 
bility: 

fj'^-{x,y)^f{x,y). (2.10) 

(a) Grant Assumption 2.4 in addition. If (nAY^^A — )■ 0, then, under any law IP^ , 
(2.10) holds as well. 

Remark. By this theorem, our estimator is consistent for every x and y 7^ if nA — )■ 00 
and A — )■ 0. In practice, however, both n and A are given! Then, for instance, if a 
continuous martingale component is present, our estimator is unreliable for all y close to 
the origin. To illustrate this important point, suppose that X is a univariate process with 
constant volatility o"^ > 0. Increments with absolute value less than (^cxA^/^, where C, is 
quite a large constant (e. g., C = 5), are predominantly due to the continuous martingale 
and not due to jumps. On the set {y : \y\ < CcrA^/^}, therefore, our estimator f^'^{x, ■) 
is unreliable regardless of the chosen bandwidth rj. 
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For the next theorem, we estabhsh additional notation. For < a < 1, let denote 
the a-stable Levy subordinator with Laplace transform Ee"^^' = e~*^" for C,,t >0. Its 
right inverse Lt '■= inf{s > : Ks > t} is called the Mittag-Lejfler process of order a. By 
abuse of notation, we call Lt = t the Mittag-Lejfler process of order 1. On an extension 

(fi, p) := {n xn',^(g) p'^ ® p') (2.11) 

of the probability space, let V = (y{x,y))x£E,y£E* be a standard Gaussian white noise 
random field (that is, the finite dimensional marginals of V are i. i. d. standard normal) 
and let L = {Lt)t>o be the Mittag-Leffler process of order 6 (from Assumption 2.4) such 
that V, L and ^ are independent. In the theorem below, convergence holds stably in 
law; that is, pre-limiting and limiting random variables are defined on the extended 
space (2.11) and we have joint convergence in law of our pre-limiting random variables 
with any bounded, ^-measurable random variable. This notion, labelled =5f— st, is due 
to Renyi (1963). 

2.10 Theorem. Grant Assumptions 2.1 to 2.5. Let rjn = {f]i,n,V2,n) be such that (2.7) 
and (2.9) hold, and let {xi,yi)i^j be a finite family of pairwise distinct points in E x E* 
such that fi'{xi) > and F{xi,E) > for each i E L If {n/S.Y~^ ^ — )• 0, then, under 
any law P'^, we have the following stable convergence in law: 



VnAVl,nV2 



where the asymptotic variance is given by 

^{x, yf := J g.iwfdw J g^izfdz. (2.12) 

In addition, let rjn be such that (2.8) holds as well. Suppose either that ai,a2 € N* 
or that Ci = C2 = in (2.8). Then, under any law P'^, we have the following stable 
convergence in law: 

(\Aw/f^^^(/^'''"(a^i, Vi) - f{xi, Vi))^ Vi) + '^^^^^j=^V{xi, Vi 



where - in the former case - the asymptotic bias '~f{x,y) is given by 



^(X v) = V ^mi+m2(^l) d""' // X ^""^ X 

' ^ ' \mi+m2\=cii ^ ^ 

l™2|^0 (2.13) 

|m|=a2 ^ 

and - in the latter case - 7(x,?/) = 0. 
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Remark. The asymptotic bias and variance of our estimator are analogous to those of 
the Nadaraya- Watson estimator in classical conditional density estimation: Kmidi) and 
/ gi{z)'^dz are the relevant moment and the roughness of the kernel Qi, respectively; and 
/ (resp., /i') plays the role of the conditional (resp., marginal) density. 

We recall that v from (2.4) satisfies Vt = t in the ergodic case, and Vt = tH{t) for 
some slowly varying function I in the null recurrent case. If we choose ?7j_„ = with 
^1 = Oi2/[d{ai + 02) + 2aia2] and ^2 = «i/[(i(ai + 02) + 2aiQ;2], then (2.7) and (2.8) 
hold with = (2 = 1- If A fast enough such that riAi+[^("i+°2)+2"i"2]/C _> in 
addition, where C, denotes the maximum of (1 — 5)d{ai + 02) + 2aia;2, 5ai{a2 + 2 + d) 
and (5a2(«i + 2 + (i^/(2 + d)), then our choice of also satisfies (2.9) for every /3 < 2. 
Consequently, our estimator's rate of convergence is 

ai02/[d(oi+a2)+2Q:ia2] (0^A\ 

In the case ai = 02, the achieved rate f"^^^"^^'^'' equals the non-parametric minimax 
rate of smooth density estimation, related to the smoothness of / as a 2(i-dimensional 
function and w. r. t. VnA- 

Remark. Bandwidth selection has always been a crucial issue in these kind of studies. 
Although orders of magnitude are crucial from an asymptotic point of view and rji^n = 
{nA)~^' for some > may be a good choice , we note that, in practice, rji n = C{nA)~^^ 
with leading constant C 7^ 1 could be a better one. A detailed analysis would go beyond 
the scope of this paper. We briefly comment on two problems: How to choose the 
bandwidths manually such that conditions (2.7-2.9) are satisfied for the unknown VnA, 
Qfi, a2 and (37 What needs to be considered when employing data-driven methods for 
selecting optimal bandwidths? 

(i) Let ao > 2 and < Sq < 1 such that Sq > d/{d + ao). If we choose rji^n = 
(nA)~-'^/(^'^+^°'''\ then (2.7) and (2.8) hold for all processes X such that Assump- 
tions 2.4 and 2.5 hold for some ai, 02 > and Sq < 6 < 1. IfA— >0 fast enough 
such that jT,Ai+2[°^o+'^]/[°o+{2+d)VQo] _^ g in addition, then our chosen bandwidth 
also satisfies (2.9). 

(ii) The asymptotic bias and variance are proportional to the value of / and its deriva- 
tives at the point of interest. The optimal bandwidth choice in terms of the asymp- 
totic mean squared error, therefore, may depend heavily on x and y. Especially 
for processes with infinite activity - where y 1— )■ f{x,y) has a pole at zero - this 
is an important issue in practice. In a future study on data-driven bandwidth 
selection methods like cross-validation, this distinction from estimating a bounded 
probability density has to be addressed carefully. 

Theorem 2.10 does not allow for a direct construction of confidence intervals. For this 
purpose, we also obtain the following standardised version. 



9 



3. Estimation from contimious-time observations 



2.11 Corollary. Grant Assumptions 2.1 to 2.5. Let r]n = if]i,niV2,n) be such that (2.7- 
2.9) hold. Suppose either that ai, a2 G N* or that Ci = C2 = in (2.8). Then under any 
law P'^, we have the following stable convergence in law: 



where = f gi{wf([w f g2{zfdz. 



in 



> (V{xi,yi) 



Remark. In principle, the results of this section are extendible to more general Markov 
models with Levy kernel F such that (2.3) holds. In view of our proofs, the assump- 
tion that X is an Ito semi-martingale is crucial for the analysis of the influence of 
discretisation (see section 5.1). Suppose that an explicit upper bound for the small-time 
asymptotic "error" 

^W[gr{A-X)]- lF{x,dw)gl 



w 



and an explicit sufficient condition which ensures 



sup 

s<l VnAVT 



d 



A /^n(X(fc_i)A) - / /in(X,,)dr 
fc=l •'^ 



-> 



for ^„ = 1 or = VnAVinVin available for some Markov process X. Then it is 
straightforward (see Lemma 5.7 and (5.32) — Lemmata 5.6, 5.9 and 5.10, respectively) 
to come up with sufficient conditions for Theorems 2.9 and 2.10, which replace (2.9). 



3. Density estimation of the Levy kernel from 
continuous-time observations — A benchmark 

The Levy kernel of a Markov process is related with jumps. In fact, our estimator (2.6) 
uses X(fc-i)A and A^X as proxies for the pre-jump value Xt- and the jump size AXt 
if, at a time t G [{k — 1)A, /cA], there is a jump from a neighbourhood of x and of 
size close to y. Eventually, such time intervals contain either zero or one such jump; 
never more. Certainly, the statistical analysis simplifies if we observed the whole path 
of X; introducing proxies would be useless. So, despite observing the whole path of X is 
somewhat unrealistic, it is theoretically important to study what happens in this case. 
This section is devoted to this question and can be viewed as a benchmark for what 
properties are achievable with a more realistic, discrete observation scheme. 



3.1. Preliminaries and assumptions 

On the filtered probability space(s) (f2, {^t)t>o, (P^)x£e), let X = {Xt)t>Q be a strong 
Markov process with values in Euclidean space E = (R!^ , ^J^'^) , or a subset thereof. Its 
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sample paths are supposed to be cadlag. We observe - continuously in time - one sample 
path {Xs{u) : s G [0,t]} for t > 0; in particular, we discern all jumps. 

In addition to the notation introduced before, we use some classical notation from 
Getoor (1975): We denote the shift semi-group on Q by {6t)t>o so that Xt+s = Xt o 9s 
for all s,t >0. We denote the transition semi-group of X on by {Pt)t>o- 

A (perfect homogeneous) additive functional H of X is an ^t-adapted process such 
that Ht^s = Ht o 9s + Hg for all s,t > 0. A Levy system [F, H) of X (in a wide sense) 
is a kernel F on E with -F(x, {0}) = and a non-decreasing additive functional H oi X 
such that, for every Borel function g : E x E ^ R_|_, probability vr on E, and t > 0, 

W g{Xs-,AXs)l{xM} = E'^ fdHs f F{Xs,dy)g{Xs,y). (3.1) 

0<s<i 

The disintegration into F and H is by no means unique. For an appropriate reference 
function go with Fgo{x) > 0, nevertheless, ratios of the form Fg{x)/FgQ{x) are unique 
outside a set of potential zero. In the cases where X is quasi-left-continuous (that is, 
when all jump times are totally inaccessible) Benveniste and Jacod (1973) proved the 
existence of a Levy system (F, H) where H is continuous. Such a process - cadlag, 
strong Markov, quasi-left-continuous - is called a Hunt process. 

Remark. The continuity of the additive functional was included as a part of the original 
definition of Levy systems due to Watanabe (1964). 

Throughout this section, we work under the following hypothesis: 

3.1 Assumption. There exists a Levy system [F, H) of X where Ht = t. o 

Recalling (2.3), we observe that all Markovian Ito semi-martingales satisfy Assump- 
tion 3.1. In analogy to the semi-martingale case, we call this F in Assumption 3.1 the 
(canonical) Levy kernel of X. It is unique outside a set of potential zero. Again, we 
assume it admits a density (x, y) h- )■ f{x,y) which we want to estimate. 

Compared to section 2, we slightly weaken the assumptions imposed on the smoothness 
of /. To obtain consistency for our estimator below, we impose Assumption 2.2 and: 

3.2 Assumption. The canonical Levy kernel admits a density /, continuous on Ex E*; 
and the invariant measure from Assumption 2.2 admits a continuous density fi'. o 

To obtain a central limit theorem, we also impose Assumption 2.4 and: 

3.3 Assumption. For some ai,a2 > 0, the canonical Levy kernel admits a density / 
such that X H- f{x, y) e C^oli^) all y G E*, and y f{x, y) G C^oli^*) fo'^ ^,11 x e E; 
and the invariant measure from Assumption 2.2 admits a continuous density /i' which is 
([tti] — l)-times continuously differentiable. o 
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3.2. Kernel density estimator 

In section 2.2, we introduced a kernel density estimator and its bias correction based 
on discrete observations. Here, we present corresponding versions which utihse the 
continuous-time observation scheme. We recall that gi and g2 are kernels with support 
5i(0) which are, at least, of order ai and a2, respectively. Given some bandwidth vector 
V = (?7i,?72) > 0, we utilise the kernels gi'^{z) = r]~'^gi{{z — x)/r]i). 

3.4 Definition. For r] = {r]i, 772) > 0, we call defined by 

[0 otherwise, 
the kernel density estimator of / (w. r. t. bandwidth 77 up to time t). o 

Our estimator in Definition 2.7 is the discretised analogue from the one presented 
here: In the numerator of the former, the jumps AXt and the pre-jump left-limits Xt- 
are replaced by the increments A'^X and the pre-increment values respectively. 
In the denominator, the sojourn time Jq gi'^{Xs)ds is replaced by its Riemann sum 
approximation A X]fc=i 5'i'^(-^(fc-i)A)- In analogy to Definition 2.8, we also introduce a 
bias correction for our estimator: 

3.5 Definition. For rj = (?7i,?72) > 0, we call 7^ defined by 



lt{x,y) := < 



|m2|7^0 

I m! oy"^ ai,a2 G N* 

otherwise, 



the 6ms correction for /; 



3.3. Consistency and central limit theorem 

Here, we present our results of this section. We continue to use the notation and con- 
ventions from section 2.3. 

We utilise the following conditions as t — oo, where < Ci, C2 < 00: 

vtvitVit and r/i,t 0, r/2,t ^ 0; (3.2) 

W,f"^<t^Ci, and W,<f"^^C2- (3.3) 
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3.6 Theorem. Grant Assumptions 2.2, 3.1 and 3.2. Let rjt = (?7i,t, 772,*) be such that 
(3.2) holds. Moreover, let {x,y) e E x E* be such that fi'{x) > and F{x,E) > 0. 
Then, under any law , we have the following convergence in probability: 

fy(x,y)^f{x,y). 

3.7 Theorem. Grant Assumptions 2.2, 2.4, 3.1 and 3.2. Let rjt = (?7i,t, ?72,t) be such 
that (3.2) holds. Moreover, let {xi,yi)i(zi be a finite family of pairwise distinct points in 
E X E* such that fi'{xi) > and F{xi, E) > for each i & I. Then, under any law , 
we have the following stable convergence in law: 



where the asymptotic variance is given by 

.2. fi^^y) 



(Tix, yY := J giiwYdw J g2{zYdz. (3.4) 

In addition, grant Assumption 3.3 and let rjt be such that (3.3) holds as well. Suppose 
either that ai,a2 ^ N* or that Ci = C2 = in (3.3). Then, under any law , we have 
the following stable convergence in law: 

(^\JvtvttVit{ft'ixi,yi) - fixi,yi))y^^ (lixi,y,) + V (xi, y^) j , (3.5) 



where - in the former case - the asymptotic bias '-f{x,y) is given by 
Li [x) I I rriilmol ox™-^ ox"^^ 

" ^ ' |mi+m2|=cn 

\m\=a2 ^ 

and - in the latter case - 'j{x,y) = 0. 

We compare Theorems 2.10 and 3.7. First, we remark that the asymptotic bias and 
variance of /^'^ are equal to those of our benchmark estimator fj^. Second, if we choose 
Vi,t = "^r^* with ^1 = a2/[d{ai + 0:2) + 2aia2] and ^2 = ai/[d{ai + 0:2) + 2aia2] again, 
then (3.2) and (3.3) hold with Ci = C2 = 1- The rate of convergence in Theorem 3.7 is 

^oi02/[rf{oi+a2)+2cna2]. 

the rates (2.14) and (3.7) are equivalent. Third, we observe that our remark on the issue 
of bandwidth selection holds analogously. Last, we note that Theorem 3.7 does not 
allow for a direct construction of confidence intervals just as Theorem 2.10. In analogy 
to Corollary 2.11, we also obtain the following standardised version. 
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3.8 Corollary. Grant Assumptions 2.2, 2.4 and 3.1 to 3.3. Let rjt = {f]i.t,V2,t) be such 
that (3.2) and (3.3) hold. Suppose either that ai,a2 e N* or that = (2 = in (3.3). 
Then under any law P'^, we have the following stable convergence in law: 



"^-^ {V{x^,y^). 
)-oo V / «€i 



where = J gi{wYdw J g2{zydz. 



4. Proofs for results of section 3 

The notion of a deterministic equivalent of a Markov process plays a crucial role in the 
limit theory for our estimator. 

4.1 Definition. A non-decreasing function v : 1R+ — ?■ 1R+ is called a deterministic 
equivalent of the Markov process X if the families 

{^{v{ty^Ht I P") : t > 0} and [^{v{t)Hf^ | P") : t > o} 

are tight for every probability n on E and every non-decreasing additive functional H 
of X with < WHi < 00. o 

We emphasise the following consequence of Theoreme 3 of Touati (1987): Under Darling- 
Kac's condition, the function v in (2.4) is a deterministic equivalent of X. For every 
H as in Definition 4.1, furthermore, we have that {v{t)~^Hst)s>o converges in law to a 
non-trivial process as t — ?■ oo. For Markov processes violating Darling-Kac's condition, 
the latter convergence may not hold. Nevertheless, Locherbach and Loukianova (2008) 
showed that some deterministic equivalent already exists when X is Harris recurrent. 

Throughout the proofs, we denote convergence of processes by double arrow ("^") and 
understand it as convergence on the relevant Skorokhod space. For instance, we denote 
by P(R'^) := P(R+; IRf') the space of all cadlag functions from R_|_ to R"' equipped with 
Skorokhod's topology. For a kernel F, a measurable function g, and a a-finite measure 
z/, the function Fg, the measure uF, and the number u{g) are given by 

Fg{x) := J F{x,dy)g{y), uF{A) := J u{dx)F{x,A), u{g) := j u{dx)g{x). 

A kernel F is called strong Feller if Fg is in the class of continuous functions for every 
bounded g. 

This section is organised as follows: First, in section 4.1 we prove a triangular ar- 
ray extension of Birkhoff's theorem for additive functionals. Second, in section 4.2 we 
introduce auxiliary Markov chains Z and Z' derived from our Markov process X. We 
show that our result from section 4.1 applies to these chains. Some technicalities are 
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put off to appendix A. Third, in section 4.3 we demonstrate a preliminary version of 
Theorem 3.6 which depends only on Z and Z'; we conclude with the final steps in the 
proof of consistency. Last, in section 4.4 we demonstrate a preliminary central limit 
theorem which depends only on Z and Z'; we conclude with the final steps in the proof 
of Theorem 3.7 and Corollary 3.8. 

4.1. An extension of Birkhoff's theorem 

The theorem presented in this subsection is the underlying key result for our proofs. It 
is a triangular array extension of Birkhoff's theorem for additive functionals (cf. Theo- 
reme II. 2 of Azema, Kaplan-Dufio, and Revuz, 1967). We prove a rather general version. 

4.2 Theorem. Let Z = {Zk)k€¥s* be a Markov chain with values in some state space D, 
with invariant probability tp, and with transition kernel . Assume that the state space 
is petite, that is, there exist a probability p on N* and a non-trivial measure Up on D 
such that, for every Borel set A C D, 

oo 

mi^Y.Pik)m\x,A)>u,{A). 

k=l 

Let (/?.„,)„gK* be a sequence of functions such that {'$hn)ne'3<}* is uniformly bounded. Let 
> be such that 

<n ^ oo, CV(^n)^c<oo, «^ ) " V( I | ) ^ and (nQ-^ipihl) ^ 

as n ^ oo. Then, under every law P'^ for some probability n on D, the following con- 
vergence holds uniformly on compacts in probability: 

G^^cs, where G: := — J2 h^{Zk). (4.1) 

^'^ k=l 

Remark. If (/i„)neN* is non-negative (resp., uniformly bounded), then nC,n — ^ oo and 
CV(/in) ^ c < oo already imply «^)"V(|/in|) (resp., (nO'^i^iK) ^ 0). 

Proof (of Theorem 4-^)- Convergence in probability is equivalent to the property that 
- given any subsequence - there exists a further subsequence which converges almost 
surely. By Proposition 17.1.6 of Meyn and Tweedie (1993), therefore, it is sufficient to 
prove this theorem under the law P''' only. 

For each s > and n G N*, we observe G" = + H'J^, where 

i7; = M|M and H': = ^j:{h^iZk)-^{h^)). 



15 



4. Proofs for results of section 3 



By assumption, we have H" — )■ sc uniformly in s as n — > oo. It remains to show that 
H'J^ converges to zero uniformly on compacts in probability. 

We note lE^[hn{Zk)] = il^^K) for every A;,n e N*; thus, ¥.^[H'p] = for all s > 0. 
Moreover, its second moment satisfies lEi'^[{H'J^)'^] = + K'J^, where 



[sn\ 



k: 



1 L'^'J 
?n k=l 



and 



2 lsn\-l lsn\ 
" Sn k=l '' l=k+l 



First, we note 



\k:\ < 



[snj 
n 



-> 0. 



(4.2) 



Second, let m e N* denote the period of Z. As the state space is petite w. r.t. \E', by 
Theorem 16.2.2 of Meyn and Tweedie (1993), the sampled chain with transition proba- 
bility is aperiodic and uniformly ergodic. By Theorem 5.4.4 of Meyn and Tweedie 
(1993), there exists a partition Dq, . . . , D^-i of the state space such that each Di is a 
recurrence class and such that the measures m%l){ ■ fl Di) are invariant w. r. t. For 
every i G {1, . . . , m} and z G -Dj, we denote j(/, z) := {i + /) mod m, where 'mod' stands 
for the modulo operator. For every hq G N*, we observe 



no 



1=1 k=0 1=1 

no mod m 

Hence, 



(4.3) 



no 
«=1 



fc=0 i=l 



By Theorem 16.2.1 of Meyn and Tweedie (1993), there exists a C < 1 such that, for 
every / = 1, . . . , m and each G N, 



sup 



km+l 



Consequently, 



(4.4) 



(4.5) 
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By (4.2) and (4.5), E'^[(i/f )2] 0, hence H'J" ^ in probability as n ^ oo. It remains 
to show the local uniformity in s of this convergence. 

By (4.3) and (4.4), we have that /i„ — tp{hn) is in the range of (I — \1/). Let /i„ denote 
its pre-image under (I — \1/) (that is, its potential), and define the process by 

We note that M" is a ^"-martingale where := a{Zk : k < [sn\). Since (\l//i„)„g]N* 
is uniformly bounded by assumption, so is {'^hn)n£¥s*- As n — oo, therefore, we have 
\Hi^-M^\ = «„)-^|^/i„(^o) - ^/^n(^L-j)l ^ 0. Likewise, lE^HM^Y] < 21E^{H'J'Y + 
2E'^ - M,"|2 ^ 0. By Doob's inequality, therefore, ^ in ucp. Hence, also 
jj/n _^ g uniformly on compacts in probability as n — > oo. □ 



4.2. The auxiliary Markov chains 

In this subsection, we construct auxiliary Markov chains Z and Z' to which Theorem 4.2 
applies. Once and for all, we fix our points of interest, i.e., {{xi,yi) : i G /} of Theo- 
rem 3.7 such that n'{xi) > and F{xi, E) > for each i. Moreover, we choose a compact 
set C D {xi : i E 1} and constants <£,£:'< oo such that e < \\yi\\ < e' for alH G / 
and such that 

MF[x,{y:E< \\y\\ < s'}) > 0. (4.6) 



Remark. Under Assumptions 2.2 and 3.2, such a set C always exists by the choice of 
the points Xi and the continuity of f on E x E*. 

Let Ti,T2, . . . denote the successive times of jumps of size between e and e' starting 
from C; that is, 

Ti := inf {t > : £ < || AX^H < e\ G C} and T„+i := Ti o 9^^ + T„. 

The conditional expectation w. r. t. the strict past of the stopping times T„, plays a key 
role. We set 

: e < \\y\\ < e'})lc{x), 

{q~^{x)f{x,y), if X G C and e < \\y\\ < e', 
0, else. 

It is well-known that Ti < oo a. s. if, and only if, fi{q) > 0. In our case, this holds 
by (4.6). Therefore, T„ < oo a. s. for all n as well. For convenience, we abbreviate the 
kernel with density p by 11; its shifted version with density (x, y) i-> p{x,y — x) we denote 



q{x) := 
p{x,y) := 
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by n. By Weil (1971), 11 (resp., 11) is the conditional transition probability kernel of the 
jumps at the time(s) T„ in the following sense: On the set {T„ < oo}, for every random 
variable Y, measurable function g, and all x, we have 

W[g{AXTj I ^T„-] = %(Xt„_), (4.7) 
W[YoeT„ I ^T„-] = nE-[F](Xr„-). (4.8) 

We note ILE-[Y]{x) = Jp{x,y) E^'+^[F]d|/. 

Let D := V{[0,1[;E) x R+ x C. For every k G N*, we define the D-valued and 
C- valued random variables 

Zk '■= (s ^{i-s)Tfe_i+sTfe, Tk — Tfc_i, and Z^. := Xt^-. 

The corresponding filtration {^k)k£¥i* is given by := ^t^,-- We emphasise that we 
exclude time k = 0. From (4.8) and Ti < oo a. s., we deduce that Z = (Zfc)fcgN* and 
Z' = (Z^)fcg]N. are ^ffe-Markov chains. We denote their transition probabilities by and 
$, respectively. We refer to appendix A for technical results on these auxiliary Markov 
chains. 

4.3 Lemma. Let {g,t,x) G D, let A C C and A C D &e measurable, and let G N*. 
Then 

^x,A) = ILF-{Z[e A){x), (4.9) 
^''+\{g, t, x), A) = ^''■^{x, A). (4.10) 

Proof. We deduce (4.9) and (4.10) directly from (4.8) and the Markov property of X, 
respectively. □ 

By Lemma 4.3, Theorem 4.2 applies to Z' and, also, to Z. 

4.4 Lemma. Grant Assumptions 2.2 and 3.2. Then the Markov chain Z' is strong 
Feller. Its state space C is petite with respect to $. 

Proof. Let / be a bounded Borel function and Xq G C. Under Assumption 3.2, we deduce 
from Lebesgue's dominated convergence theorem that q is continuous. By (4.6), we have 
that X I—)- p{x,y) is also continuous for every y and svLp{p{x,y) : x & C,y E E} < oo. 
Again by Lebesgue's dominated convergence theorem, we conclude that 

lijn M.) = li.n /p(x. + y)Ay = /p(x„. + y)Ay = n,(x„). 

X — >Xq X — ^3^0 J J 

By (4.9), consequently, $ = nP (Z( G ■) is strong Feller on C. 

By the same argument as for the equivalence of Ti < oo a. s. and /i(g) > 0, we have that 
the measure with //-density q is an irreducibility measure of Z'. Under Assumption 2.2, it 
is absolutely continuous. Thus, its support has non-empty interior. By Theorem 6.2.5 (ii) 
of Meyn and Tweedie (1993), therefore, every compact set - hence the state space C of 
Z' - is petite with respect to $. □ 
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4.5 Corollary. Grant Assumptions 2.2 and 3.2. Then the state space D of Z is petite 
w. r.t. 

Proof. By Lemma 4.4, there exists a probability p on N* and a non-trivial measure Vp 
on C such that, for every Borel set A ^ C, 

oo 

inf ^p(A;)$'=(x,A)>z/,(A). 

""^^ k=i 

Let {g,t,x) G D, A C D be measurable, and p be the probability on N* given by 
p(l) = and p{k) = p{k - 1) for > 1. By (4.10), then 

oo oo 

J2m'^Hi9,t,x),A) = Y.p{k)^'<i^{x,A) > Up^HiA) =: UpiA). 

k=l k=l 

Since z/p is non-trival, so is Up. □ 

4.3. Proof of Theorem 3.6 

Throughout the remainder of section 4, we work under the law P'^ for some initial 
probability vr on and, for presentational purposes, we suppose w. 1. o. g. that p{q) = 1. 
We consider the processes G""'"^, J^'^ and S"^'^ given by 

G:''{^^y) ■■=-!: 9riXn-)9ri^xT,), (4.11) 
^ fc=i 

k=l 

We emphasise that these processes are of the form J2\:=i hn{Zk) where Z is the auxiliary 
Markov chain defined in section 4.2. We utilise the following preliminary condition as 
n^oo (cf., (3.2)): 

"^l,n^2,n and ?7l,n 0,?72,n 0. (4.13) 

4.6 Lemma. Grant Assumptions 2.2, 3.1 and 3.2. Let rjn = ?7i,n be such that (4-13) 
holds. Then the following convergences hold uniformly on compacts in probability: 



J^i^) - E anXr,^) and S:'\x) := - / gr{Xr)^r. (4.12) 



^ sq{x)p'{x) and S^-ix) ^ sp'{ 



Proof. Let t/' and (f denote the invariant probabilities of Z and Z', respectively. We 
apply Theorem 4.2: 

(i) We note that J"'''"(x) is of the form (4.1) with ^„ = 77^ and /i„ : C — ?■ R given by 
hn{z) = gi{{z—x)/rin); (/in)neN* is Uniformly bouudcd. By Corollary A. 6 where /i(g) = 1, 
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q is the /i-density of ip. Also q and are continuous. By Lebesgue's differentiation 
theorem, thus, 

V(^n) = Vn'^ J Kdz)q{z) gi{{z - x) / 7]^) q{x)fi'{x). 

Since nr]^ — )• oo, hkewise, {n'r]'^''')~^(p{\hn\) — )• as n — )• cxd. ^ 

(ii) We note that S'"'''"(x) is of form (4.1) with = 77^ and /;,„ : D — R given by 
hn{g,t,z) = t Jq gi{{g{s) — x)/rin)ds. By Corollary A. 6, = cp'^. By Lemmata A. 2 
and A. 5, thus, 

Vn'^^iK) = Vn'^ J f^idz)g^{{z - x)/7]n) 

Likewise, (n?7^'^)~V(l^n|) < {'mi^Y^ S ii{dz)\gi{{z — x) / 'qn)\ — )■ 0. By Corollary A. 4, in 
addition, we observe 

^{hl) < 2||^i||oo S ji{dz)\gi{{z-x)/7]n)\ ^ Q 



4.7 Lemma. Grant Assumptions 2.2, 3.1 and 3.2. Let rjn = {rii^n,V2,n) be such that 
(4-13) holds. Then the following convergence holds uniformly on compacts in probability: 



Gy''ix,y)^.sf{x,y)j^'{x). 



Proof. Let (=^")s>o be the filtration given by J^J" := ^t^^^j+i-- By (4.7), we have 
E[AG^'''" I jejl] = gf''''{Z'k)Iigf'y{Zl) for s = k/n. Thus, "the compensator of G"'''" 

w. r. t. {M'r)s>o is given by H^'^- := eS ^??"'^(4)%2'"''(^D- 

Fix s > 0. In analogy to the proof of Lemma 4.4, 115^2"'^ is continuous under Assump- 
tion 3.2. In analogy to Lemma 4.6, Z^itTi \9i""^{^k) I converges in ucp to a non-trivial 
process as n — 00. Therefore, 

_ Ugr''{x)J:'^-{x)\ < sup \Ilgr''{z) - Ilgr''{x)\ ■ - E Ig^^iZDl ^ 0. 

Since p is continuous under Assumption 3.2, lim„_>.oo 11(72 "'^(x) = Pi^^v) by Lebesgue's 
differentiation theorem. We recall f{x,y) = q{x)p{x,y). By Lemma 4.6, hence. 



It remains to prove M" := G" — if" uniformly on compacts in probability. By 
(4.13), we have supj| AM,"||oo < inr]i'&'^)-^\\gi\\oo\\92\\oc ^ 0. By Theorem VI IL 3. 3 3 of 
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Jacod and Shiryaev (2003), thus, it is sufficient to show that the predictable quadratic 
variation (M", M")^ of converges in probabihty to zero for all s. We observe 



^- k=l 



1 1 Vsn\ 

< -^r^ ■ -T^vingr^'iZir / ^ p{Ziy + r^,,^z)g,{zYdz. 



nrjf. 



In analogy to Lemma 4.6 again, Si^i ndT"^ {Z'^^Y converges in ucp to a non-trivial 
process as n — oo. As in the proof of Lemma 4.4, moreover, p is bounded on C x i?. 
Consequently, (M", M"')^ — )■ in probability as n — )■ oo. □ 

Next, we carry Lemmata 4.6 and 4.7 over to the time-scale of X. Let J be the process 
given by 

oo 

Jt:=Y.Mt]iTk). (4.14) 

k=l 

We note that J is a non-decreasing additive functional of X. It is the random clock of Z 
(and Z') in terms of X. By (3.1) - where Ht = t and by /i(g) = 1, we have lEi^Jt = t 
for all t > 0. 

4.8 Lemma. Grant Assumptions 2.2, 3.1 and 3.2. Let v : ]R+ — ^ R+ denote a deter- 
ministic equivalent of X , and let rjt and {x,y) & E x E* be as in Theorem 3.6. Then 

the family {G^^ (x, y), 8]% (x) | P^) : t > o} is tight. (4.15) 

Moreover, each limit point of the family in (4- 15) is the law J^{f{x,y)fi'{x)L,fi'{x)L) 
for some positive random variable L. 

Proof. As J is a non- decreasing additive functional of X, by Locherbach and Loukianova 
(2008), the families {^{Jt/vt | P'^) : t > 0} and {^{vt/Jt | P'') : t > 0} are tight. By 
Corollary VI. 3. 33 of Jacod and Shiryaev (2003) and Lemma 4.7, thus, 

the family {^(G^'*'''*(x, y), S'"''^*{x), Jt/vt, Vt/Jt | P") : t > 0} is tight. (4.16) 

Let Q denote a limit point of the family in (4.16), and let {tn)n£¥! a sequence such that 

^(G^'*"'''*"(x,2/),5'''"'''*"(x), J,„M„,^;,JJ,„ I P-) -^Q. 

On some extension of the probability space, w. l.o.g., there exists a random variable 
L > such that Q = =Sf(s i— sf{x,y)fi'{x), s i— s/i'(x), L, l/L). Since its ffist and 
second marginal are the laws of continuous processes, we have 
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Proof (of Theorem 3.6). For every t >0 and each x and y, we have 



f?'{x,y) 



Let /in : D — )■ R be given by hn{g, t, z) := t Jq \gi"'^{g{s))\ds. By Lemma A. 2 and Corol- 
laries A.4 and A.6, we have ^{hl) < 2\\gi\\ooVi,i{i^iz€C q{z))~^fr{\gi"'''\). By Markov's 
inequahty, since v^rjf^ — )■ oo, therefore, 

v^' f gr\XMs < ik'K,{Zj,+,) 0. (4.17) 

By Proposition 17.1.6 of Meyn and Tweedie (1993), in analogy to the proof of Theo- 
rem 4.2, this convergence in probability holds under every law P'^. 

We recall the results from Lemma 4.8. Let L > be a random variable such that 
the law j!f{f{x,y)^'{x)L,fi'{x)L) is a limit point of the family in (4.15). Moreover, let 
(^n)neN* be a sequence such that 



{fix,yMx)L,^^'ix)L 

We recall fi'{x) > 0. Consequently, ft^"{x, y) — )• /(x, y) in law as n — ^ oo by the contin- 
uous mapping theorem. As this limit is unique and independent of the particular limit 
point of the family in (4.15), we have that //"(x, y) converges to f{x,y) in law, hence, 
in probability. □ 

4.4. Proofs of Theorem 3.7 and Corollary 3.8 

In this subsection, we work on the extended space (2.11), L denotes the Mittag-Leffler 
process of order < 5 < 1, and W = {W^)i<zi denotes an /-dimensional standard Wiener 
process such that L, W and ^ are independent. 

In addition to the processes G"''^, J^'^ and 5""''' given in (4.11) and (4.12), we consider 
the process f/"''' given by 

U:'\x, y) := {g:'\x, y) - ^^^Jfff ^"''(^)) " (^-l^) 

We emphasise again that these processes are of the form Y}k=i hn{Zk) where Z is the 
auxiliary Markov chain defined in section 4.2. 

4.9 Lemma. Grant Assumptions 2.2, 2.4, 3.1 and 3.2. Letrjn = {r]i,n,V2,n) be such that 
(4.13) holds. Then we have the following convergence in law in Vi^Bj): 



(f/r*(x„2/.))^^, ^ {^^\xi)a{x,,y;)Wl 
where a^Xjy)"^ is given by (3.4). 
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Proof. For n G N*, let M"''' be the process given by 
M:'^{x,y) := 



n 



k=l 



Tfe_i 



and let {JifJ')s>o be given by JifJ^ := ^^t^^^j • By Theorem VIII.3.33 of Jacod and 
Shiryaev (2003), it is sufficient to prove (i)-(iv) as follows: 

(i) We have f/"'''" (a;, y) - M"'''"(x, ^ in ucp as n cx). 

(ii) The process M"''' is an J^"-martingale for each n. 

(iii) For alH, j G /, we have 



(iv) We have the "conditional Lyapunov condition' 



k=l 



0. 



(i) We note that U^'^{x,y) — M^''^{x,y) is of form (4.1) with /i„ : D — )■ R given by 

M.,t,.)^t^ ..(^j (F,r^(.(s))- ^^^;^^,„g- ^ Jd., 

and = fh^nVirl'^^^^^'^ ■ By Lemmata A. 2 and A. 5 and Corollary A. 6, we have 



Since ?72,„ — 0, we also observe 

^{\hn\) 



<<n\^^i\9r''Fgry\)+^^{\gr'^\). 



-> 0. 



By Corollary A. 4, likewise, 



< 



fi{\gr'^Fgr''\)+K\9r'i) 



<n inf^6C'g(2;^ 
Since n^„ — oo, we deduce from Theorem 4.2 that (i) holds. 



1 J 



-> 0. 
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(a) By construction, M"-'"^ is integrable and adapted to (^"')s>o- For s = k/n, we 
note JifJ^ = c^Tfc_i- By (3.1) - where Ht = t - the compensator of our process's jump 
measure is given by dt ® F{Xt, dy). By Doob's optional samphng theorem, thus. 



Tk-i 



gnX,)FgnXs)ds 







for all k G N*. Therefore, M"'''^{x,y) is an ^^"-martingale. 
(Hi) Let i,j G /. In analogy to step (ii), we deduce 



M' 



{xi,yi),M''^''"{xj,y,) 



/i,n h,n ^ gr^^gr'''^{Z',)gr^^gr'''{AXTj ^t,_, 



n 



k=l 







For all n large enough, we have g1""^'gi"" ^ = whenever Xi ^ Xj, and g2"'^'g2 
whenever yi ^ yj. For all u, ifi j, thus, {M"'''^"{xi,yi), M"-''^'^{xj,yj))s 0. 

Moreover, let Jf'''"(x) := n-^r/f „ eS E^^^-i [^^(^D^]- We note that J"^'''" is of 
form (4.1) within = vL and /i„ : D ^ R given by hn{g,t, z) = E^^^^ [(7i((Z( - x)/7]i,n?]. 
By Lemma A. 5 and Corollary A. 6 and under Assumption 3.2, we observe 

ri^^^ijj{hn) = / fi'{x + r]i^nz)q{x + r]i^nz)gi{z)'^dz > ^\x)q{x) / gi{zfdz. 



(4.19) 



By Theorem 4.2, since hn is non-negative and uniformly bounded, thus. 



X) 



ucp 



sq{x)iJ,'{x) / gi{zfdz. 



Hence, we observe 



M"*(x,y),M"'''"(x,?/)) - Jr'^"(x)p(x,y) g2{wYdw 



< J'^ '^"(x) / g2{w) dw sup p{x + rii^nZ,y + ri2,n'w) -p{x,y) 

J z,weBi{0) 

Since f{x,y) = q{x)p{x,y), consequently. 



0. 



M"*(x,2/),M"*(x,?/) 



> sf{x,y)fJ^'{x) / 5(i(tf;) dw; / 5(2(2;) dz; 



that is, (iii) holds. 

(iv) We observe |K'"'''"(a;, ?/)| < K'J'''^" + K'^"'''^", where 



4 2d 2d [sn] 



k=l 



griZ[)griAXT:, 
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and 



A^2d 2d \_sn\ 



EE 



k=l 



Ti 



grFgnXs)ds 



We note that K'^'^" and K""^'"^" are of form (4.1) with ^„ = niql^rjl^/A and, respectively, 
K{g,t, z) = [g,{{Z[ - x)/r/i,„)^^72((AXT, - i/)/r/2,n)'^ , 

and 

'w-y\\ 



/i„(^7,t,z) = E^(°) 



Ti 



9i 



By Lemma A. 5 and Corollary A. 6, for K'"^, we have 



nvinVin 



fi'{x + 'r]i^nz)gi{z) f{x + rii^nZ,y + r]2,nw)g2{w) dwdz 



-> 0. 



By Corollary A. 4 and Lemma A. 5, for K"^ moreover, there exists a C < cxo such that 



< 



4C 



nvtnVt 



+ r]i^nz)\gi{z)\f{x + r]i,nZ,y + r]2,nw)\g2{w)\dwdz 



0. 



Since, in both cases, /i„ is non-negative and uniformly bounded, we deduce from Theo- 
rem 4.2 that \K^''^"{x, y) < K'J'''^" + K"""'"^" ^ in ucp as oo. □ 

Next, we carry Lemma 4.9 over to the time-scale of X. We recall that the additive 
functional J of X, given in (4.14), is the random clock of Z (and Z') in terms of X. In 
addition, let L* denote the process given by L* := v^^Jst- 

Under Darling-Kac's condition, we have the important Theoreme 3 of Touati (1987) 
at hand; see also p. 119 of Hopfner et al. (1990) and Theorem 3.15 of Hopfner and 
Locherbach (2003). For reference, we include it as the following proposition. 

4.10 Proposition. Grant Assumptions 2.2 and 2.4- Let H = {H^,...,H'') be a fi- 
integrable additive functional of X with (component-wise) non- decreasing paths. Then, 
under every law P'^, we have the following convergence in law in V{R^): 



(^t ^Hst)s>0 



t—>oo 



(4.20) 



Recalling Lemma 4.6, by eq. (3.4) of Hopfner et al. (1990), we obtain the following 
corollary to Proposition 4.10. 

4.11 Corollary. Grant Assumptions 2.2, 2.4, 3.1 and 3.2. Let rjt = ?7i,t be such that 
(3.2) holds. Then we have the following convergence in law in r'(Ri+^).- 



('5'i,*'''*(xi) 



L, (^ij'{xi)L 
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4.12 Lemma. Grant Assumptions 2.2, 2.4, 3.1 and 3.2. Let rjt = {f]i,tiV2,t) be such 
that (3.2) holds. Then we have the following convergence in law in 'D[^^^): 

where a{x,y)^ is given by (3.4). 

Proof. From Corollary 4.11 and Lemma 4.9, we infer 

L* ^ L and (f/^'-'''(x„ y,))^^j ^ y^)W')^^^ . (4.21) 

Thus, the families 

{^(L* I P") : t > O} and {^((f/'^"'"(x„ y,)),^/ | P") : t > o} 

are C-tight. By Corollary VI.3. 33 of Jacod and Shiryaev (2003), we conclude that 

the family {^(l*, (?/'''■'" (x^, yi))i^i | P" ) : t > o} is C-tight. (4.22) 

In the remainder of this proof, we abbreviate U^^ := {U^*'^^{xi,yi))i^j. 

Let {n, #) := (V{R x R^), ^(R x R^)) denote the canonical space, and let (L, W) be 
the canonical process. Moreover, let P be an arbitrary limit point of the family in (4.22). 
We deduce from (4.21) that its marginals are given by the Mittag-Leffler law of order 6 
and the /-dimensional (scaled) Wiener law, respectively. For convenience, we abbreviate 
Qi := =Sf(L I P) and Q2 := ^{W \ P). Suppose that L and W are independent 
processes under P. Then P = Qi ® Q2 holds. As P is an arbitrary limit point of the 
family in (4.22), then it has to be unique. Hence, {^{{L\ IP") \ P'^) ^ Qi ® Q2 weakly 
as t — 00. o 

Let K denote the right-inverse of L, i.e., Kf := inf{s : Lg > t}, and let {Jift)t>Q be 
the filtration on Cl which is generated by the process {K, W). Suppose that - under 
P - K and W are processes with independent increments relative to {J^t)t>o- (That 
is, Kt+s — Kt and are independent for all s,t > 0, and Wt+s — Wt and are 
independent for all s,t > 0.) Then, in analogy to Step 6 on p. 122 of Hopfner et al. 
(1990), we deduce that - under P - the pair {K, W) itself is a process with independent 
increments relative to {J^t)t>o- We recall that K is a, (5-stable subordinator, thus, purely 
discontinuous (resp., deterministic if 5 = 1). Since W is continuous, hence, K and W 
are independent processes - under P. Consequently, P = Qi Q2. o 

It remains to show that - under P - K and W are processes with independent 
increments relative to {J^t)t>o- This, however, follows in analogy to Step? on pp. 123f 
of Hopfner et al. (1990) with obvious notation. □ 

Next, we demonstrate that the convergence in Lemma 4.12 holds stably in law. 
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4.13 Lemma. Grant Assumptions 2.2, 2.4, 3.1 and 3.2. Let rjt be as in Lemma 4-12. 
Then, we have the following stable convergence in law in 



L, (iJ,'{xi)a{xi,yi)Wi 



i&I 



where a{x,y)'^ is given by (3.4). 

Proof. Let /i be a bounded, Lipschitz continuous function on P(R^+^) and F be a 
bounded ^-measurable random variable. With a{x,yY given by (3.4), we abbreviate 



We have to demonstrate 



•^ii yi 



and W := (^^i\xi)a{xi,yi)W'' 



(4.23) 



First, we suppose that Y is ^^^-measurable for some u > 0. Let a be given by 
a* = (s — ut~^)^. Then a* converges to = s as t — > oo. By Lemma 4.12, since a* 
is non-random, ^(a*, L\ L?""* | P'") ^{a, L,W \P) weakly as t oo. The paths of 
the limit process are a. s. continuous. By eq. (3.4) of Hopfner et al. (1990), therefore, 

^{a\ Lit, W o L*, I P-) if (a, L, Wl\P). 

Since E"[/i(L*, o o L*0 o 0„)r] = E"[E^"[/i(L*,, t/^'* o by the Markov 

property, and since E^[E[/i(L, Wl)]Y] = E[/i(L, Wl)] E'^F, consequently, 

W[h{Lit o 0„ (t/^'* o o ^„)F] > M[h{L, Wl)] WY. 



For every r > 0, we note 



sup 

s<r 



sup 

s<r 



and 



sup 

s<r 



'V' oL^t^oeu-U"' oL 



< 



\gi\\ooi\\g2\\ooJu + ^2,tl|-^5'2'^ll°o^) 

\fwiUnit 



0. 



Since /i is Lipschitz, therefore, 

h{L\ t/^'* o L*) - /i(L*. o e^, {u"^ oLl)o e^] 



t— >oo 



0. 



Since /i and Y are bounded, we deduce from Lebesgue's dominated convergence theorem 
that (4.23) holds for all bounded ^^-measurable random variables Y . 

Second, for arbitrary bounded .^-measurable Y , we have E'^fFj^a] — > F in £^ as 
M — cx). Consequently, again by Lebesgue's dominated convergence theorem. 



lim sup 

t>o 



E'' 



h{L\ U^" o L\ W o L*)(E"[r|^J - Y)] 



0. 



Thus, (4.23) holds in general. 
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By Corollary 4.11 and by eq. (3.5) of Hopfner et al. (1990), we obtain the following 
corollary to Lemma 4.13. 

4.14 Corollary. Grant Assumptions 2.2, 2.4, 3.1 and 3.2. Let rjf be as in Lemma 4-12. 
Then we have the following stable convergence in law in P(]R^^).- 

(S^.'^^ix.), ^r.'^*(x„ y.)),,,, ^ (/i'(x.)L, /i'(x,)a(x„ y^)Wl)^^^ , 

where a{x,yY is given by (3.4). □ 
Proof (of Theorem 3.7). For every t > and each x and y, we have 



I - X f^?A,' (a:, y) - P'ix, y)JvitV2t/vt It 9T'"iXM 

^JvtV,,tV2,t[}t {x,y) } (x,y)J- Sl%{x)+v^'f'^^jr{X,)ds 



s 



where f"^{x,y) := fi{gi'^Fg2'^)/ fi{gi'^). Let /;,„ : D — ^ R be as in the proof of Theo- 
rem 3.6. We recall ipihD < CVi,n ^^r some ( < oo. We also note ft?7^f — )• oo. In analogy 
to (4.17), thus. 

Since L and W are independent, V{xi,yi) := defines an /-dimensional 

standard Gaussian random vector such that L, V and ^ are independent. By the 
continuous mapping theorem and Corollary 4.14, consequently. 



\/vtvttVit(ft"ixi,yi) - f^'ixuyi))) (a{xi,yi)V{xi,y^)L^^^^) 

where a{x,yY is given by (3.4). o 
In addition, grant Assumption 3.3 and let rjt = {rji t, r72,t) be such that (3.3) holds as 
well. We abbreviate 'y^{x,y) = f^{x,y) — f{x,y) and note 

fJ'igVh'^i^, y) = + Viz) (f{x + r]iz, y + r]2w) - f{x, y))gi{z)g2{w)dwdz. 



We apply Taylor's theorem to fi' and /: In x, we expand up to the order \ai \ — 1 
and, in y, we expand up to the order [a2l ~ 1- We recall from (2.5) that gi and g2 
are, at least, of order ai and a2, respectively. By a classical approximation argument, 
therefore, there exists a constant ^ < oo such that \fi{gi'^)^^*{x,y)\ < CiVi^t + '72,?)- If 
Ci = C2 = in (3.3), then it is immediate that {vt■r]f ^■r]^^y/'^^'^'{x, y) -f 0. If tti, ^2 G N*, 
more explicitly, 

\mi+m2\=ai 
\m2\^0 
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Since — /i'(a;), we have ivtr]f^r]2^Y^'^'y'^*{x,y) 'jix,y) given by (3.6). □ 

Proof (of Corollary 3.8). In analogy to the proof of Theorem 3.7, by Corollary 4.14 it 
remains to show that {vtr]fir]2t)^^'^lT{Xiy) is a consistent estimator for 'y(x,y). 

We recall that in classical (conditional) density estimation, the (partial) derivatives 
of a consistent density estimator - provided they exist - are consistent for the (partial) 
derivatives of the estimated density. In analogy to Lemma 4.8, we observe that this is 
also true in our context. In particular. 



A f'nUr A f(r and ^0 dx^9i lA.jds p. Q^..^^{x) 

dx^^^dy^^^' ^ dx^^dy^'-^^^ ''^' '"(X,)ds fi'{x) ' 



If either ai,a2 G N* or (^i = ^2 = in (3.3), consequently, ivt'r]f^r]2iY^'^^t'' {x,y) — )■ 
7(x, y) in probability as t — 00. □ 



5. Proofs for results of section 2 

Throughout this section, ( < oo denotes some generic constant which may depend on 
the variables specified at the beginning of each proof. It may change from line to line. 

This section is organised as follows: First, in section 5.1 we study the influence of 
discretisation on our estimator. We prove results for the small-time asymptotic of Ito 
semi-martingales and for the sojourn time discretisation error. Second, in section 5.2 we 
prove an auxiliary, non-standard martingale limit theorem. Third, in section 5.3 we prove 
the consistency of our estimator (Theorem 2.9) utilising our results from sections 4.3 
and 5.1. Last, in section 5.4 we apply Theorem 5.5 from section 5.2 to our case and 
conclude with the final steps in the proof of the central limit theorem (Theorem 2.10 
and Corollary 2.11) utilising our results from sections 4.4 and 5.1. 

5.1. Small-time asymptotic and sojourn time discretisation error 

In this subsection, we study the influence of discretisation. 

We compare our estimators in Definitions 2.7 and 3.4: In the numerator of the former, 
the jumps AXt and the pre-jump left-limits Xt- are replaced by the increments A^X 
and the pre-increment values respectively. Our Ito semi-martingale meets the 

following small-time asymptotic: 

5.1 Proposition. Let A be a compact subset of E x E* , tjq < min{\\y\\ : {x,y) G A}, 
and let g be a twice continuously differentiable kernel with compact support. Grant 
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Assumptions 2.1 and 2.3. Then, for every m G N*, there exists C, < oo such that 



^(aAl)/2 



A 



2V(/3+d) 



E 2k 
k=l 'I / 



2(m+l)+d 



(5.1) 



holds for every {x,y) & A, rj < r]o and A < 1, where g^''^{w) = rj '^g{{w — y) /rf) . 

Remark. For presentational purposes, we have left a small gap in the finite activity case. 
For instance, if / is locally bounded on E x E, then we can improve the bound in (5.1) 
replacing ry2v(/3+d) ^2 independently of the dimension d. 

In the former estimator's denominator, the sojourn time Jq gi'^(Xs)ds is replaced by its 
Riemann sum approximation A X]fc=i 5'l'^(-^(fc-l)A)• 
5.2 Proposition. Let x & E, v : R+ — ?■ R+ be a non- decreasing function, ^„ > 0, 
r]n — 0, and (/i„,)neN* a uniformly bounded family of twice continuously differentiable 
functions supported on Brj„{x) such that {ri]^^d"^hn)ne¥i* is uniformly bounded for every 
multi-index m with \m\ G {1,2}. As nA — > oo and A — 0, we suppose v{nA)ri^ — )• oo 
and ^„Ar/-2-"'[(i-2/{/3+'i))vo] ^ g. 

(i) Grant Assumptions 2.1 to 2.3. If nA'^^n — > and v{s) = v{st) for some deter- 
ministic equivalent v of X and some t > 0, then, under any law P'^ , we have the 
following convergence in probability: 



sup 



<t v{nA)ri^ 



A /i„(X(,_i)a) - / hn{Xr)d 
k=i 



0. 



(5.2) 



(a) Grant Assumptions 2.1 to 2.4. If (nA)^ "^A^^ — t- and v is the regularly varying 
function from (2.4), then, under any law , (5.2) holds for all t > 0. 

Before we turn to the proofs of Propositions 5.1 and 5.2, we present two auxiliary 
upper bounds for the small-time asymptotic of Ito semi-martingales. Below, we heavily 
utilise results and notation from the books Jacod and Shiryaev (2003) (esp.. Chapter II) 
and Jacod and Protter (2012) (esp.. Section 2.1). 

We recall that our underlying process X is an Ito semi-martingale with absolutely 
continuous characteristics (5, C, n) satisfying (2.2), and that its jump measure m is the 
random measure on R+ x E given by m(dt, dx) := Z^IsiAx^t^o} f(s,AXs)(dt, dx). For a 
function g onVt x x E., we define the stochastic integrals 

g'kvc\.t:= I g{(jj,s,w)m{uj](\s,(\w) and (7*nj:= / g{(jj,s,w)n{uj](\s,(\w), 

J[Q,t\y.E J[Q,t\y.E 
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and also the purely discontinuous martingale g -k (m — n)t, as soon as these integrals are 
well-defined. By Levy-Ito and Grigelionis decomposition, we can assume w. 1. o. g. that 
there exists a (i- dimensional Wiener process W, defined on {^t)t>o, (P^)xeE), and 

a.n E ^ i?-valued function a with c = aa^ such that 

Xt = Xo+ f b{Xs)dt+ f a(X,)diy, + (wl|j^||<i)*(m-n)t + (wl||„,||>i)^mt. 
JO Jo 

Ito's formula plays a crucial role in the sequel. By a version derived from (2.1.20) of 
Jacod and Protter (2012), if (7 : — )• R is twice continuously differentiable, then 

g{Xt) = g{Xo) + l\{X,yVg{X,)ds + ^ |^*tr [c{Xs)VYXs))ds 
+ (^(X„ + w)- g{XJ) - w'Vg{XJ)) l|i^||<i ^n^ 
+ a{Xs)dWs + [g{X^ + w)- g{X 



(5.3) 



<i .^r - njt 



g{X^ + w)- 5((X_)]l||^||>i 



where tr(-) denotes the trace operator on E ® E and V'^g denotes the Hessian of g. 

For ,^ > 0, we denote by := inf{t > : || AXt|| > ^} the first time of a jump greater 
than ^ . Also, we introduce the following decomposition of our semi-martingale X: 

Xt = Xo + X^ + X;«, where X't^ := (wl||^||>^) * = AX,l|iAX,||>e 

s<t 

We note that X^ and X'^ are again Ito semi-martingales; we denote their characteristics 
by {B^,C,n^) and {B'^,0,n'^), respectively. Furthermore, we decompose X^ into drift 
B^, continuous martingale part M^, and purely discontinuous martingale part M^. These 
are given by 

S« = /%«(X,)ds, M^=fa{X,)dWs and Mf = {wl\\^\\<^) ^ {ra - n)t, 
Jo Jo 

where 6^ (x) = 6(x) — F(x, dw)w if ^ < 1, and b^{x) = &(x) F(x, dw)w 
if ,^ > 1. Under Assumption 2.1, we derive the following two lemmata. 

5.3 Lemma. Let ^0 > o.iT'd p > 2. Grant Assumption 2.1. Then, there exists a 
constant C, < 00 such that, for every 0<C,^^o,xEE, and t < 1, we have 

E^sup||xf^^,r <c(i + iixr)t. 

s<t 

Proof. In this proof, ( < 00 may depend on ^0 and p but neither on t, x, ^ nor 
First, let 1 < ,^ < ,^o- We emphasise that, in this case, 

m^)\\ < mx)\\+^'o^'F{x,{l < \\w\\<^o}). (5.4) 
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By (2.2), we have n«(dt, A) = dtF«(Xj, A) := dtF{Xt, A n B^{0)) for every Borel set A. 
By construction, X't^ = on {t < T«}. By (2.1.43) of Jacod and Protter (2012), thus, 

snp\\Xl^, r<CW [t^-i J^WbHXo + Xl^M'ds + t^/'~' j\c{X, + X^^.W^ds 
+ CW l^dsj F^"{Xo + Xl,dw)\\w\\P 

Under Assumption 2.1, for all t < 1, we observe 

E-sup||x;^^,r < c Ai + Eii^o + ^f,rcr)ds- 

s<t Jo 

For C' > 0, let S'^' := mi{s > : ||X||| > ('}. Then 

where we note supg<jX^^^^^^j, || <(' + By the Gronwall-Bellmann inequality, thus, 
E^sup||Xf^^,^^,,r < C(l + Ikr) ft + f\e<('-^Ms) = C(l + ||xr)(e^* - 1). 

s<t \ JO / 

Since S^' AT^ as C' ^ oo, consequently, E"^ sup,<i||Xf^y5||P < C(l + Ikf )t. 

Second, let < ^ < 1. We note that Xfl^^^i = {Xt - Xo)!^^^^ holds, and that X^ 
is continuous at outside the null set {||AX2-e|| = C,}. As < for all u, thus, 

SUp||Xj = SUp||(X, - Xo)l^<T€|| < SUp||(X, - Xo)ls<Ti|| = sup||X]^yi|| 



s<t 



s<t 



s<t 



almost surely. By case ^ > 1, consequently, E"* sup5<jX^^y^ p < ^(1 + ||x||^)t. □ 

5.4 Lemma. Let y and rjo < \\y\\ . Grant Assumption 2.1. Then, for every m G N*, 
there exists a constant ( < oo - non-increasing in \\y\\ - such that, for every x E E, 
r] < r]Q, and t < 1, 



F%XteBr,iXo + y)) 

<Cfl + ||a;|p("+^) + ||y||2('"+i) 



tri'^il + Y. 

\ k=l 



fc -2V(/3+d)-2(fc-l) 



-,2m 



(5.5) 



Proof Let 1 < C' < (||z/||/^o)^/^"'+^\ £ ■= (C""^^^o - C'""Vo)/Q > and ^ < e/2. In 

addition, let g he a, C^-kernel such that 1bi[o) ^ 9 ^ '^^(i'+i)/2i'^)- 9v(^) — 
g{{z — X — y)/r]) and abbreviate 

h{t,v) := F%Xt e B,{x + y)) < g,{Xt). 



32 



5. Proofs for results of section 2 



In this proof, ( < oo may depend on rjo, /3 and m, but neither on x, t nor 77. 
By Ito's formula (5.3), we have h(t,r]) < \Hi \ + \H'f^\ + \H'/^\, where 

:= W b{X,yS7gr,iX,)ds + ^ W tr (c(X,) V2(7,(X,))ds, 

:= E^^*dsls,,^(,+3,)(X,) I F(X„dw){(?,(X, + w) - -«7^V(7,(X,)l||^ll<i}, 
Hp ■.= M^ J^dslB^,^^.+yr{Xs) J F{Xs,dw)gr,{Xs + w). 

Under Assumption 2.1, b{z) and c{z) are bounded in norm by C(l + Moreover, 
the gradient and Hessian of g^j vanish outside i?(^/+i)^/2(x + and satisfy < C'?"^ 

and ||c?ij5'j,|| < Cv~^- Hence, 

m < C(l + ||x|r+ ||yf)r/-2E- /*l5^^,^^,,^/^(.+,)(X,)d.. 



For z G B(^'n{x + y), furthermore, 

dw7){(7,,(2 + - - w"Vg.,iz)l\\^i\<i} < ^J'^''^ / F{dw){l A ||u^f ). 



Therefore, 



Suppose that {H't'^l < ({1 + + \\yf){tr]'^ + t^r/"^) holds. Then, 
h{t, v) < C(l + Wxf + ||yf )tr/'^(l + tv-^^^'^) + ^^^^"''"'^"^"'^ r Hs, Cv)ds. 



^2 



By iteration, we obtain (5.5) after m steps. 

It remains to prove \H^^\ < (^{l + \\x\\^+\\y\\^){tri'^+t'^ri~^). Under Assumption 2.1 (iii) 
on the one hand, we have 

F{z, dw)gri{z + w) < C(l + \\z\\)ri'^ J f{y + x — z + riw)g{w)dw 

^ iai + \\x\\)v^ ifzeB,,{x), 
- \C(l + Ik + y\\)v'' if 2: e B^+^,r,{x + yy. 

For z G i?i+^/^(x + y)\ B(^'ri{x + y), on the other hand, we have 
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Since r]'^ < r] ^ and / F(dw)(||w||'' A 1) < oo by assumption, thus, 



F{z,dw)gr,{z + w) < \ , II ||w -f ^ T, f \ ^^-^^ 

C(l + \\x\\)v , if z e B3,{x). 



Let S''^ := inf{t > : > 3e}, and := {S''^ < T« A t}. We split the set 

nx[0,t] into Ai := fix |0, tAT? A5"'«|, A2 := (fif x [T? At, t] and A3 := fif^ x fS'''^ t]. 
Then we obtain the following: 

First: Since sup^<jX^^y5^^j_j — Xo|| < 3e, by (5.7), we obtain 

||^^dP^dsl5^,^(,+,)c(X,) J F{X,,dw)g,{X, + w)<ai + \\x\\)tv''. 

Second: Under Assumption 2.1, we have 

P=^(T« < t A < E^- r dslB3,(,.)(X,)F(X„ > < C(l + \M)t. 

Jo □ 

By the Markov property and (5.7), therefore, 

|^^dP"dslB,,^(,+,)c(X,) I F{X,,dw)gr,iXs + w) 

< E^- l{Te<tA5^.q E''^^ ^* dslB^,^(,+j,)c(X,) J F{X,, dw)g,{X, + «;) 



(5.8) 



< C(l + \\x + ylDtr]-^ P^(r« < t A S''^) 

< C(l + \\xf + ||yf )t^r/-^. 



Third: By Lemma 5.3, we have P^'(r2j'^) < ^(1 + ||x||^)t. By the Markov property and 
(5.7), therefore, 

JJ^ dP"dslB^,^(,+,)c(X,) I F{X„dw)gr,{Xs + w) 

< Cil + \\x + y\\)t7]~^ (5-9) 

< c(i + \\xf + \\yf)t'^v~'^- 

We turn to the proofs of Propositions 5.1 and 5.2. 

Proof (of Proposition 5.1). Let 1 < C' < (min{|||/|| : {x,y) G A}/?7o)^/*^'""'"^\ and e,^ > 
be given as in the proof of Lemma 5.4. In this proof, C < 00 may depend on r/o, C' 1 Pi 
m and the set A, but neither on x, y, A nor 77. 

Let Tj < riQ, and {x,y) G A. W. 1. o. g., we assume that g is supported on -Bi(O). To 
avoid cumbersome notation, we abbreviate /i^ = g^'^~^y. From (2.2) and Ito's formula 
(5.3), we obtain E^ hr,{XA) = HI + H2 + H^', where 



Hl = W [''b{XtyVh,{Xt)dt + \E^ Ttr (c(X,)V\(Xi))dt, 

JO ^ J 
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iJ2 = E" 



j\tlB^,^^^+y){Xt) J F{Xt,dw){hr,{Xt + W) - hr,{Xt) - w"Vh^{Xt)lll^ll<i}, 

dtlB^,^(x+yY{Xt) J F{Xt, dw)h^{Xt + w). 



By (5.6), we observe 



tt'V 

-"a 



< 



c 



4+2 



7]' 



['^F^{XteB^.,{x + y))dt. 

•J 



By the choice of C', Lemma 5.4 imphes 

"A2 / 



\Hl\ + 



tt'V 

-"a 



1 + E 



A 



m+l 



2V(/3+d)+2(fc-l) 



2(m+l)- 



V 



(5.10) 



Suppose 

- J F{x, dw)h^{x + w) V^{Xt ^ B^,.,{x + y)) 

< ((A^+("^^)/^ + A^r/"(^+'^)) 

Combining (5.10) and (5.11), we obtain (5.1). 
It remains to prove (5.11). By (5.7), we observe 

C, if 2 G B^,{x). 



(5.11) 



j F{z,dw)hr, 



'z + w) < 



(5.12) 



Let the stopping time S^'^, and the event fi^^ be given as in the proof of Lemma 5.4. 
We spht the set x [0, A] into Ai := n x [0, A A T« A S''% A2 := (fi^^)= x |T« A A, A] 
and A3 ■.= ni^x I^'^'^A]. For convenience, we also abbreviate 

fx,yi^^ ^) ■= /(^' y + x-z + r]w)- f{x, y + r]w). 
Then we obtain, first: By the choice of e, we have that the convex hull of the set 

{{z,y + {x — z) + rjw) : (x, y) E A,\\z — x\\ < 3e,\\w\\ < 1} 

is a compact subset of x E*. By Assumption 2.3 and for all {z,w) G B^^ix) x -Bi(O), 
we have \ f^^y{z,w)\ < (\\z — a;||"^"'^. By Lemma 5.3, therefore. 



dP^dt / dwg{w)fly{Xt,w) < CAE^sup||X^ 



t<A 



l+(aAl)/2 



Second and third: We compare (5.7) and (5.12). In analogy to (5.8) and (5.9), respec- 
tively, by the Markov property and (5.12), therefore, 

dP^dtl5^,^(.+,)c(X,) J dwg{w)fly{X,,w) < CA'7^-(^+'^\ 
for i G {2,3}. In summary, we proved (5.11). □ 
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Proof (of Proposition 5.2). W. l.o.g., we assume rj < 1/4. In this proof, ( < oo may 
neither depend on n, A nor 77. 
By Ito's formula (5.3), we observe 



VnAVn 



/ hn{Xr)dr - A 2^ /l„(X(fc_i)A 



where 



H: 



In ._ 



VnAV 



L«"J rkA ft f 1 / x\ 

^ E / / 6(X,)'V/i„(X,) + -tr (c(X,)V\„(X,)) dr. 



[snj 

E 



kA 



dt 



dr 



E. 



I In ._ 



/ di/;){/i„(X, + I/;) - /i„(X,) - z/;'V/i„(X,)}, 

J||to||<l 
Sri 



and 



E/ dt E l||Ax.||>i{/i„(X,_ + AX,)-/i„(X,_)}, 



Mr 



d E / / V/i„(X,) V(X,)dl^, 



+ r / {/i„(X,,_+w)-/i„(X,_)}(m-n)(dr,du;)l. 

J{k-l)AJ\\w\\<l J 

It remains to show: 

(i) Under Assumptions 2.1 to 2.3, if v{s) = v{st) for some deterministic equivalent v 
of X and some t > 0, and if nA'^^n 0, then H^, H'J^, H'J^ and M" converge to 
zero uniformly on {0 < s < t} in probability. 

(ii) Under Assumptions 2.1 to 2.4, if v is the regularly varying function from (2.4), 
and if (nA)i-^A^„ ^ 0, then H^, H'J", H'^"" and converge to zero uniformly 
for {0 < s < t} in probability for all t > 0. 

(a) Under Assumption 2.1, b{z) and c{z) are bounded in norm by C(l + IklP). More- 
over, the gradient and Hessian of hn vanish outside Br)^{x) and satisfy < Cr]~^ 
and < C^~^) by assumption. Thus, 

bizYVKiz) + Ut (c(z)VV(^))| < C(l + \\z\\)7^-'1b,a,){z). 

By Fubini's theorem, therefore, 

sup \H:\ < C(l + \\xf)^ST'^''^-, where '^'^ = ^ / lB,(.)(X,)dr. 

r<s T VnAV -^0 
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In case (i), we deduce from Lemma 4.8 that the family {=Sf(S'^ 



P^) : n e N*} is 



tight under Assumptions 2.2 and 2.3. As A^nVn'^ ~^ O5 sup^<i — ^ in probabihty. 
In case (ii), we obtain from Corollary 4.11 that S""'^'^" converges stably in law to a 
non-trivial process. As A^nVn'^ ~^ O5 sup^<^ — )• in probability for alH > 0. <> 
(b) Let C' > 1 ^iid K = 1 A 2/(/3 + (i). Under Assumption 2.1, we have 



F{z, dw){hn{z + w)- hn{z) - w^Vhn{z)} 

w\\<l 

rC(l + |k||)r/-2/||^,|I^^F(d^/;)||w|p, for <C'C 
< C(l + \\A\)Vn^^ /||.||<i F{dw)\\w\\^ for Ctn < Ik - xll < 1 + r/. 
I 0, else. 



(5.13) 



Again by Fubini's theorem, therefore, 
supli/n <C(l + ||a:||) { 



^gln,A,('ri'^ _|_ g/n,A,l+r;„ 



t<s 



nd+2 

In 



7]' 



d+Kl3 



In analogy to step (a), since A^„?7„^ '^^^ '^^ — j- 0, H!p — )■ uniformly on {0 < s < t} in 
probability in case (i); and for alH > in case (ii). o 
(c) In analogy to steps (a) and (b), we note 

|i/ri < ^nA{VnAvty\\hn{X^ + ^) | + |/l„(X_)|)l j|w)j|>l * ^lsn\A 



< \K:\ + |Ar-„j/J + \KT\ + liVj 



[snj / n I 



where 



s 

K 

n: 

NT 



^nA{VnA'nn)~^\hn{X^)muj\\>l -k nisn]A, 
^nA{VnAvt)~^\hn{X- + W)m^\\yi ^ (iTT - n)snA, 
^„A(w„A?7n)"^|/in(A:_)|l||.u,|l>l * (m - n)snA- 



Under Assumption 2.1, since i||i„||>i F{z, dw)\hn{z + w) \ = for z ^ Bi^2rf„{x), we have 

F{z,dw)\K{z + w)\ <C(l + ||x||). 



i«||>i 



In both cases (i) and (ii), therefore. 



sup|JCI<C(i + IN'^'''^'^" 



s<t 



VnA 



for all t > 0. Furthermore, we observe that A^" is a martingale w. r. t. the filtration 
(=^snA)s>o- Its predictable quadratic variation satisfies 



'-\hn{X_+w)\'l 



'nA 



2. ^ II n^S'^^il 

«,||>i^n,„A < C(l + Ikll) , . 



0. 
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Since [snj/n — s, Nl^gn\/n ~^ uniformly on {0 < s < t} in probability for all t > 0. 
In addition, we recall that F{z, {\\w\\ > 1} < C{1 + under Assumption 2.1. Thus, 

sup \KT\ < C(l + ||x||)e„A5r'^* ^ 

s<t n-Kxi 

in case (i); and for all t > in case (ii). Again, we observe that A^'" is a martingale 
w. r.t. the filtration {^snA)s>o- Its predictable quadratic variation satisfies 

(AT'", AT'"), = ^^|/i„(X_)|2l||^||>i^n,„A < +11 ^n g>nA,V. y 0. 

Thus, A^'"„j/„ — )• uniformly on {0 < s < t} in probability in case (i); and for alH > 
in case (ii). o 
(d) Let (M^")s>o and (M"'^)cj>o denote the ^snA-martingales given by 

C fsnA 
Sn 



f- psnl\ 



VnAVn -^0 

:= ^^<^A(r)(/in(X_ + w)- /i„(X_))l||^||<i ^ (m - n),„A, 
where v^a(?') := A — (r — [r/AJ A). The predictable quadratic variation of M'" satisfies 
(M'^M"^), = ^^;^ r y.A(r)'V/i„(X,)^c(X,)V/i4X,)dt 

< C(l^NnA^.'n,A* 

As A^nVn'^ ~^ '^nAvt. C)0, M^" — > Uniformly on {0 < s < t} in probability in 
case (i); and for all t > in case (ii). 

In addition, the predictable quadratic variation of M"" satisfies 



(M"",M""), = -^^^^(r)''ih4X.+w)-h4X.))%^ii<,^nsnA 

'"nAVn 
A2t2 ^snA 



< 



VnAV?f 



/ dr / F{Xr, dw){hniXr + w) - hn{Xr)f 

Jo J\\w\\<l 



Let C' > 1 and K = 1 A 2/(/3 + d) be as in step (b). By (5.13), 
F{z,dw){hniz + w) - hniz)f 



w\\<l 



C(l + |k||)r/-2/||^ll<,F(dw)||w||2, for <«, 
< U(l + IklD^r" /||^||<i F{dw)\\wf, for C'< < \\z - a;|| < 1 + r/„, 

else. 
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Therefore, 



q>n,A.l+ri„ \ 



Again since A^nVn^ ^'^^ '^^ ~^ ^s"" ~^ uniformly on {0 < s < t} in probability in 
case (i); and for all t > in case (ii). □ 

5.2. Auxiliary martingale limit theorem 

The theorem presented in this subsection serves as a preliminary result for the proof of 
our central limit theorem (Theorem 2.10 and Corollary 2.11). It is a non-standard limit 
theorem for a triangular, martingale array scheme. 

Here, we work on the extension (2.11) of the probability space, L denotes the Mittag- 
Leffler process of order < 6 < 1, and W = {W^)i^i denotes an /-dimensional standard 
Wiener process such that L, W and ^ are independent. 

5.5 Theorem. For n G N*, let (^")s>o he the filtration given by := <^isn\A, o.nd I 
be a finite index set. Moreover, let hn '■ E x E ^ be such that ||/in||oo ^ as n ^ 00. 
Grant Assumptions 2.2 and 2.4, and suppose that the process M"' given by 

lsn\ 

M;:=5:/.„(X(fc_i)A,A^X) (5.14) 

k=l 

is a ^^-martingale such that the predictable quadratic co-variation {M"'^ , M'^^) is iden- 
tically zero for every i ^ j and all n large enough. If {{M^'^ , M'^^))i(zj converges stably 
in law in V(Il^) to (^,?L)jg/, then 

(Ql^Die/. 

n— s>oo 

Proof. Let 6=1. Then we have Lg = s. Therefore, the convergence of to (<^f Vr*)jg/ 
follows directly from standard results (see section VIII. 3c of Jacod and Shiryaev (2003)). 

For the remainder, let < 5 < 1. We consider the processes L"*, L", K"' and iV" 
given by 

[sn\ 
k=l 

L::=Y.L:\ K::=mi{s>0:L:>u} and K := M^„. 

We emphasise that iV"(L^) = M," + AM;^„(^„^ holds for all s. As || AAf"|| < ||/i„|| 0, 
it is sufficient to prove that we have the following stable convergence in law in D(Rx R^): 

{L\N-) (fL, ' ^^^^^ -■=T.^"- (5-15) 
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First, by the continuous mapping theorem, we obtain 

Second, we remark that is a predictable 5f "-stopping time for all m > 0. Thus, A^" is 
a martingale w. r. t. the time-changed filtration JifJ^ := Moreover, we observe that 
its predictable quadratic variation satisfies 

By (5.16), we have that |L"* — (^j/^)^L"| — )• uniformly on compacts in probability for all 
i E I. We note that the (scaled) Mittag-Leffler process q^L is a. s. continuous. Its right- 
inverse K given by '■= inf{s > : ^^Ls > m} is a (deterministically time-changed) 
(5-stable Levy process, hence, without fixed time of discontinuity. By (3.2) of Hopfner 
et al. (1990), therefore, — j- [qi/q^s in law for every s > 0; hence, in probability. By 
construction, we have that ||AA^"|| is bounded above by ||/i„||oo- This bound converges 
to zero. By standard results (see above), consequently. 

In analogy to the proof of Lemma 4.12 and Steps 6 and 7 on pp. 122-124 of Hopfner 
et al. (1990), we obtain that the pair (L",iV") converges in law in P(R x R-^) to 
(^^L, [{c;i/q)W^)^^^). Finally, the stable convergence in law and the independence from 
^ follows in analogy to Lemma 4.13. □ 



5.3. Proof of Theorem 2.9 

Throughout the remainder of section 5, we work under the law P'^ for some inital 
probability vr on and we denote := {x E E : fi'{x) > 0, F{x, E) > 0}. 
We consider the processes G""'^'^ and R"-'^'^ given by 

g:'^^\x, y) := — e 9r{x^k-i)A)9r{A'',x), (5.18) 

VnA k=l 

R^'^'^{x):= — Y.gnX^k-i)A). (5.19) 

VnA 

5.6 Lemma. Grant Assumptions 2.1 to 2.3. Let rjn = ?7i,„ be such that (2.7) holds, and 
let X G E(Q . 

(i) IfnA^ 0, then, 

the family [S£ (i?"''^'''" (x) | P^) : n G N*} is tight. (5.20) 
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(a) Grant Assumption 2.4 in addition. If {nAy A 0, then, (5.20) holds as well. 



In both cases, each limit point of the family in (5.20) is the law =Sf for some 

positive random variable L. 



nA,Ti„ . 



X) 



Proof Let := v^^ g:i'''{Xr)dr. By Lemma 4.8, the family {^{S'l 

P^) : n e N*} is tight; moreover, each of its hmit points is the law ^{fi'{x)L) for some 
random variable L > 0. In both cases (i) and (ii), since rjn is such that (2.7) holds, we 
have 



nA,rin I 



X) 



> 



by Proposition 5.2. Consequently, the family {=Sf | P'^) : n G N*} is tight; 

moreover, each of its limit points is a limit point of the family {^[Si^{x) \ P'^) : t > 0}, 
hence, the law ^{^'{x)L) for some random variable L > 0. □ 



5.7 Lemma. Grant Assumptions 2.1 and 2.3. Let rjn = {vi,ni V2.n) be such thatrji n — > 0, 
V2,n and Ar/^^^^^+'^^ 
with compact support. Then 



V2,n — ^ and Ari2^^'^^'^^ — t- 0. Moreover, let (x, y) G -E© x E* , and let g be a C"^ -function 



lim sup 

2e-S'7i,„{^) 



1 g^-'yiA-,X) - fix, y) j g{w)dw 



0. 



(5.21) 



Proof. First, by Proposition 5.1 - where we choose m large enough - we have 



lim sup W g^'^'iA'lX) - Fg 



0. 



Second, under Assumption 2.3, / G C^ci^ ^ ^*) for some a > 0. Therefore, 



lim sup Fg'^'"y{z) - Fg 



aM 



ze-Br,i^„(a:) 

Third, by Lebesgue's differentiation theorem, we observe 



0. 



lim 

n— j-oo 



Fg'^-^y{x)- f{x,y) / g{w)dw 



5.8 Lemma. Grant Assumptions 2.1 to 2.3. Let rjn = {f]i^nifl2,n) be such that (2.1) 
holds. Moreover, let {x,y) G E^ x E* . Then, in both cases as in Lemma 5.6, 

the family (^"'^'''"(x, y), i?"'^'"'" (x) | P") : n G N*} is tight. (5.22) 

Moreover, each limit point of the family in (5.22) is the law ^{f{x,y)fi'{x)L,^'{x)L) 
for some positive random variable L. 
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Proof. We note that y) = f{x, + //^'^■''(x, y) + M;'^'''(x, y), where 

[snj 



1 '-""J , . 

H^'^'^ix, y) = — E anXik^DA) (e^(-^)- [^?2"'^(A^X)] - A/(x, y)) , (5.23) 
1 L™J 

M;'^'^(x,y) = — J2gr{X^k.r)A) (gn^X) -E''^^~^^-[gr{A-X)]) . (5.24) 

VnA k=l ^ ' 

By Lemma 5.6, it is sufficient to prove that iff '^'''"(x, y) and '^'''"(x, y) converge to 
zero in probabihty as n — )■ oo. 

(E) We observe 

n 

i^r'^''(x,2/)|< sup |A-^E^[(7r(AW]-/(^,2/)hniE^^'''(^(fc-i)A)' (5-25) 

where /i is a C^-function dominating \g\\. The sequence (f ~^ Z]fc=i A/i''"''^(X(fc_i)A))neN* 
is tight in analogy to Lemma 5.6. As sup^g^^^ ^(^)|A-i E''[5(^"'''(A^X)] - j{x,y)\ 

by Lemma 5.7, we have Hi'^'^"(x,y) — ?■ in law, hence, in probability. <> 
(M) We observe that M"'^''' is an .^LsnjA-martingale. We note sup^<;^ || AM"'^'''" || < 
(i'nA'7i,„'72,n)~"'^llfi'illoo||5'2||oo " > by (2.7). By Theorem VIIL2.4 of Jacod and Shiryaev 
(2003), thus, it is sufficient to show that the predictable quadratic variation of M"'^'''" 
at time one, denoted (M"'^'''", M"''^'^")i, converges to zero in probability. 
We observe 



n 
k=l 



By Lemma 5.7, sup,gB^^^(,)|A-i E^r/^^^^?^ '"(A^X)^! ^ f{x,y) J gi{wYdw. In analogy 

to step (H), since VnAVinV2n ~^ ^> have (M"''^'^", M"''^'^")i — > in law, hence, in 
probability. □ 

Proof (of Theorem 2.9). We recall the results from Lemma 5.8. Let L > be a random 
variable such that the law ^ {f {x , y) fi' (x) L , fi' (x) L) is a limit point of the family in 
(5.22), and let {nk)k£¥s* be a sequence such that 

(Gr'^*'=(x,y),i?r'^*'=(x)) ^ (/(x,y)/.'(x)L,/i'(x)L) . 

Since /i'(x) > 0, by the continuous mapping theorem, we conclude 

As this limit is unique and independent of the particular limit point of the family in 
(5.22), we have that ff^^^x.y) converges to f(^x,y) in law, hence, in probability. □ 
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5.4. Proofs of Theorem 2.10 and Corollary 2.11 

Throughout this subsection, we work on the extension (2.11) of the probabihty space, 
L denotes the Mittag-Leffler process of order < 5 < 1, and W = (iy*)jg/ denotes an 
/-dimensional standard Wiener process such that L, W and ^ are independent. 

We consider the processes G""'^'"^ and R"-'^^^ given by (5.18) and (5.19), and the pro- 
cesses ^"'^'^ and i?'"'^'^ given by 

f/r'^-n^, y) ■■= (g:'^'^^, y) - ^^^J^Ifp i^r-^'n^:)^ (5.26) 

K'^'^^ix) ■.= — Y: vfgnX^^k-DA?. (5.27) 

We recall that, under Darling-Kac's condition, we have TheoremeS of Touati (1987) at 
hand (see Proposition 4.10). First, we obtain an extension of Lemma 5.6. 

5.9 Lemma. Grant Assumptions 2.1 to 2.4- Let rjn = r]i^n be such that (2.7) and (2.9a) 
hold, and let {xi)i,zj be a family of pairwise distinct points in E^. If (nA)-'^~''A — 0, 
then, under any law IP^ , we have the following stable convergence in law in V{R^^): 

(i?"'^*(x,),/2''^'^*(x,))^^^ g| (/i'(x,)L,/i'(a;,) J g^iwfdwL^ (5.28) 

Proof Let Sl'^^ix) := v^^ J^' g^{''{Xr)dr and 5^*'''(x) := t;rVoSV^(^r)^dr. We note 
that M"'"") /i'(x) and /i(r/^(^i"''')^) -> f^'i^:) J giiw^dw for all x. By Theorem 4.2 
and Proposition 4.10, we deduce - in analogy to Corollary 4.14 - that 

(5"^*(x,),^'"^*(x,)). ^f^' f/i'(x,)L,/x'(x,) / g2{wfdwL) . 

For every x, moreover, we deduce from Proposition 5.2 that 



^ 0. 



Consequently, we obtain (5.28). □ 
In view of Theorem 5.5, we obtain the following preliminary result. 

5.10 Lemma. Grant Assumptions 2.1 to 2.5. Let rjn = {'r]i,n,V'2,n) be such that (2.7) 
and (2.9) hold, and let (xj, yi)iei be a finite family of pairwise distinct points in E^q x E* . 
If (nA)^"'^A — )• 0, then, under any law ¥^ , we have the following stable convergence in 
law in V{W): 

(i?"'^*(x,),[/"'^*(x„2/,))^^^ {fi'{x,)L,a{x,,y,)fi'ix,)Wl)^^^, (5.29) 
where a{x,yy is given by (2.12). 
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5. Proofs for results of section 2 



Proof. Let (W^)s>o be given by = ^[snjA, and let the process M"'^'^ be given by 



I d d" L*"J 
VnA 



We note that M"'^'^ is a ^J^-martingale of the form (5.14). The proof is divided into 
four steps: First, we prove 



n,A,r)n 



ucp 



0. 



(5.30) 



Second, we show that the predictable quadratic variation of M^'^'^{x,y) satisfies 



((M"'^*(x„|/,),M-'^*(x,,i/,))) ^ ([a(x„y,)/i'(x,)]2L 



(5.31) 



in V(R^). Third, we show that (M"'^'''"(xj, yj), M"'^'''" (xj, ?/j)) vanishes for all n large 
enough if i 7^ j. Last, we argue 

(i?-'^*(x,), (m-'^*(x,, y,), M"'^*(x,, y,)))^^^ ^ (/i'(x,)L, [a(x,, y,)/i'(a:^)]'^)^^^ 

in D(R^^). By Theorem 5.5 and (3.5) of Hopfner et al. (1990), we then have (5.29). 

(i) We note U^'^''^{x,y) - M^'^'^^ix.y) = //^'^'^(x, y) + //f '^■''(x, y) with 



A 



lsn\ 



H:'^'^{x,y) := ^v^^r^fr^i— J] ^7i(X(,_i)a) F.^^ (^(.-i)a) - 



VnA 



k=l 



gi ^ g2 



r),x\ 



\K'''^\x,y)\<^VnAvH sup 



Wgr{AlX)-Fgr{z) 



where R!'J^'^''^{x) = A?;"^, Sitl'i ^'''^(^(fc-i)A) for some C^-function h, dominating \gi\. 
Under Assumption 2.5, Fg2'^ is twice continuously different iable. Since (2.9) holds, by 
Proposition 5.2 and step (i) in the proof of Lemma 4.9, H"''^'^"{x,y) ^ in ucp as 
n — )■ 00. By Proposition 5.1 - where we choose m large enough - we have 



sup 

zeBn-i^{x) 



-Wgr{A-^X)-Fgr{z] 



since (2.9a) holds. Since, moreover, (2.9) holds, therefore, 

1 



VnAVtnVin SUp 

zeBr,^ (x) 



Wgr''{A'lX)-Fgr''{z] 



0. 



(5.32) 



In analogy to Lemma 5.9, R""''^'^"{x) converges stably in law. Thus, \H'J^'^''^{x,y) \ =^ 
in ucp as n — 00. Consequently, (5.30) holds. o 
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5. Proofs for results of section 2 



(a) We note (M"'^'^(x, y), M"'^'^(x, y)), = K^^^''^{x,y) - /sTf '^•''(x, ?/), where 



d d L'"^J 



and 



li^f'^'^a;,?/)! < sup 



x). 



By Lemma 5.7 and the continuous mapping theorem, 



sup 



By Lemma 5.9, R'^'^'^"'{x) converges stably in law. Since A?72„ — ?■ 0, we observe that 
\K'J^'^'^"{x, y)\ converges to zero uniformly on compacts in probability as n — )• oo. 
Again by Lemma 5.7, 



sup 

2G-B^i ^(a;) 



ak E^(-.)A g^-y{A-^Xf - fix, y) I g2iwrdw 



-> 0. 



ucp 



In analogy to K''^'^'^{x,y), therefore, 

- fix,y) J giiwfdwR'^'^^^"{. 
By Lemma 5.9, consequently, 

(i^"'^*(a:„y.))^^^gg J gr{w)'dwf,'{x.) J ^?2(^)Mzl) 



(5.33) 



G/ 



hence, (5.31) holds. ^ 
(in) Let i,j G /. We note that for all n large enough such that r)i^n,V2,n are small 

enough, we have g^^'^^ g^"'^^ = Q whenever Xi ^ Xj, and (?2"'^'5'2"'^^ = whenever yi ^ yj. 

For all uj and n large enough, thus, (M"'^'''"(xj, |/i), M"'^'''"(xj, ?/j))s = if i 7^ j. ^ 
(^TO^ By Lemma 5.9 and (5.33), we obtain the joint convergence of (i?"''^'''"(xi))jg7 and 

(M'"'^'''"(xi, yi), M"'^'''"(xi, yi)))iei to the required limit. □ 

Proof (of Theorem 2.10). For every n, and {x,y) G -E© x we have 



n,A,rj„ 



x,y) 



Rl'^'^'-ix) 



where f'^{x,y) := ^{g^'^Fg2^)/ fi{gi'^). Since L and IV are independent, V{xi,yi) := 

— 1/2 

-^1 defines an /-dimensional standard Gaussian random vector such that L, V and 

are independent. By the continuous mapping theorem and Lemma 5.10, consequently. 



■JnAvtnVin (/^'''" (a^*, l/i) - (a^i, ?/i)) £A (a(x„ 1/,) (x^, ?/i)Li ^ 



iG/ 
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A. On the auxiliary Markov chains Z and Z' 

where a{x,yY is given by (2.12). 

In addition, let r}n = {r]i,n,V2,n) be such that (2.8) holds as well. It remains to prove 
{unAVinV2ny^'^if^"i^^y) ~ fi^iV)) ^ li^iV)- This, however, follows in analogy to the 
proof of Theorem 3.7. □ 

Proof (of Corollary 2.11). In analogy to the proof of Theorem 2.10, by Lemma 5.10 it 
remains to show that {vnAVi nV2 nY^'^^rr i^^ v) ^ consistent estimator for 7(x, y). This, 
however, follows in analogy to the proof of Corollary 3.8. □ 

A. On the auxiliary Markov chains Z and Z' 

In this appendix, we derive an explicit representation for the transition kernel $ of the 
auxiliary process Z\ and (in-) equalities for expectations of the form E^(/q^^ h[Xs)(lsY . 
In addition, we derive representations for the stationary probability measures ip and 
of the processes Z and Z' . 

We invoke technical results on resolvents of semi-groups. The resolvent {R\)x>o of 
a semi-group {Pt)t>o is given by Rx '■= exp{—\t)Ptdt. For bounded measurable 
functions h, the generalised resolvent kernel Rh is given by 

Rh{x,A) := E^' /°°e-/o'^(^^)d^l^(Xi)dt VxeE,Ae<ff. 
Jo 

These kernels were first introduced by Neveu (1972). For a comprehensive interpretation, 
we refer to section 4 of Down, Meyn, and Tweedie (1995). 

A.l Lemma. Let (-Ra)a>o be the resolvent of X, and let (_R^)a>o be given by 

CO ^ 

Rl ■■= Rx E ((I? - \^)Rx) > -^here I,/i(x) := q{x)h{x). (A.l) 

Then {Rx)x>o is the resolvent of a positive contraction semi-group. For its corresponding 
process X* , we have that the laws o/A*1|o,Ti[ a^c? A1|o,Ti[ a'^e equal. 

Proof. Since Igll is a bounded kernel, (-R^)a>o is the resolvent of a positive contraction 
semi-group by Theorem 4.2 of Bass (1979). It follows from Sawyer (1970) and Chapter 6 
of Bass (1979) that, for the process X* (corresponding to {R])x>o)i we have that the 
laws of X*l|o,Ti[ and X1|o,Ti[ are equal. □ 

A. 2 Lemma. Let h he a measurable function on E. Then 

h{Z[) = R*glgh{x) and W J^' h{Xs)ds = Rlh{x), (A.2) 

where R* denotes the generalised resolvent kernel associated with the modified resolvent 
{R*x)x>o and the function q. For every Xq > \\q\\^, we have i?* = ET=o Rl.O-x.-qRl^)'' ■ 
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A. On the auxiliary Markov chains Z and Z' 



Proof. We recall that the laws of X*1|o_Ti[ and X1|o,Ti[ are equal. The expectation 
of h{Z[) under P^, therefore, coincides with the expectation of h{X*) sampled at an 
independent killing time according to the multiplicative functional exp(— /q g(X*)ds). 
In formulas, we have 

Wh{Z[) = E^' r e--fo'^'-^*^'^'q{X*)h{X*)dt. 
Jo 

By eq. (19) of Down et al. (1995), hence, W h{Z[) = R^lghix), where denotes 
the generalised resolvent kernel associated with the modified resolvent {R\)x>o- By 
Chapter 7 of Neveu (1972), i?* = ET=o Rl.O-x.-gRV" holds for every A, > ||g||oo. 
Similarly, we observe 

h{Xs)ds = W r e~/o «(^i^)'i«g(x;) /* h{X:)dsdt. (A.3) 
Jo Jo Jo 

By Fubini's theorem (cf., eq. (20) of Down et al. (1995)), consequently, 

E^ h(Xs)ds = E^' r e-^o'^^^s)ds^x*)dt = Rlhix). 

Jo Jo ^ □ 



Remark. It is immediate from Lemma 4.3 that $ = IlR*lq. 
We obtain two corollaries: 

A.3 Corollary. Let hi, . . . ,hk be measurable functions on E. Then 



^ nil ^ roc / N /'*' 

E^- n / h,{X,)ds = E / e-/o "(^^^'^^hjiXn n / hiiX:)dsdt. (A.4) 



Proof. In analogy to (A.3), we observe 

k 



E^ 



n / ' h,ix,)ds = r e-/o^^^^)d-g(x;) n / h,{x:)dsdt. 

f=l "'0 -^0 Jj{ Jo 

By the Leibniz rule, moreover, 

n fh,{x:)ds = j2fh,{x:)Y[ rhi{x:)drds. 

By Fubini's theorem, therefore, we have (A.4). □ 

A.4 Corollary. Let h be a bounded measurable function on E. For all k G N*, if 

inf^.esupp(ft) q{x) > 0, then 



w(rh{X.,)ds] < ,,,^, R;\h\{x). (A.5) 

V-'o / (inf^esupp(/i)'?(a;))'^ ^ 
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Proof (by induction) . By Lemma A. 2, we immediately have (A. 5) for k = 1. We assume 
that (A. 5) holds for some /c G N*. Then we deduce from Corollary A. 3 and \h\ < 
q\\h\\oo/{inixesnpp(h) q{x)) that (A. 5) holds for k + 1. □ 

A. 5 Lemma. nlgllR* = fi. 

Proof. By Theorem 4.2 of Bass (1979) and Section 7 of Neveu (1972), we have 

(i,n - (I -i?r'))i?; = I, 

where the formal inverse of Ri is defined by R^^ := Er=o(I - ^i)^- 

Since is invariant 

w. r. t. {Pt)t>o, we also have fiRi = jj, and /i = fiR^^. Hence, ^Igli = fi{lqTl — (I — -Rf^))- 
Therefore, filgllR* = /i. □ 

A. 6 Corollary. The measures ip := {ix{q))~^^\q and tjj := are the invariant proba- 
bility measures w. r. t. $ and . 

Proof Since $ = IiR*glq, we observe yulq$ = filg. By (4.10), = cp^'''^ = Lp"^ in 

addition. □ 
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